TY - JOUR
T1 - Multi-Task Probabilistic Regression With Overlap Maximization for Visual Tracking
AU - Feng, Zihang
AU - Yan, Liping
AU - Xia, Yuanqing
AU - Xiao, Bo
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Recent researches made a breakthrough in visual tracking accuracy. Many trackers benefit from the object state representations and network loss functions, which mine the output space and improve the power of supervision, respectively. Probabilistic regression method models the noises and uncertainties in the annotations. However, advanced trackers with probabilistic regression are not studied sufficiently in the aspect of supervision and the aspect of robustness of evaluation maximization. In this paper, an overlap maximization network in the manner of probabilistic regression is proposed to improve the learning ability of the network and the discriminative ability in the evaluation maximization. Firstly, the probabilistic regression is extended with the intersection over union (IoU) evaluation, which is normalized as a probability density in the regression space. Secondly, the classification probability is added as a branch of the iterative evaluation module to improve the ability of distinguishing objects in the evaluation maximization. Moreover, the two branches are constructed into a joint probabilistic regression task of IoU evaluation, which makes the network learn from two types of ground truth and provide a consistent result with multi-branch outputs. For feature interpretation, the strip pooling network and the space-time memory network are introduced to encode long-range context and provide dynamic features, respectively. Compared to the state-of-the-art probabilistic regression trackers and other advanced trackers, the experiments show that the proposed tracker achieves outstanding performance across the six datasets, including GOT-10k, LaSOT, TrackingNet, UAV123, OTB-100 and VOT2018.
AB - Recent researches made a breakthrough in visual tracking accuracy. Many trackers benefit from the object state representations and network loss functions, which mine the output space and improve the power of supervision, respectively. Probabilistic regression method models the noises and uncertainties in the annotations. However, advanced trackers with probabilistic regression are not studied sufficiently in the aspect of supervision and the aspect of robustness of evaluation maximization. In this paper, an overlap maximization network in the manner of probabilistic regression is proposed to improve the learning ability of the network and the discriminative ability in the evaluation maximization. Firstly, the probabilistic regression is extended with the intersection over union (IoU) evaluation, which is normalized as a probability density in the regression space. Secondly, the classification probability is added as a branch of the iterative evaluation module to improve the ability of distinguishing objects in the evaluation maximization. Moreover, the two branches are constructed into a joint probabilistic regression task of IoU evaluation, which makes the network learn from two types of ground truth and provide a consistent result with multi-branch outputs. For feature interpretation, the strip pooling network and the space-time memory network are introduced to encode long-range context and provide dynamic features, respectively. Compared to the state-of-the-art probabilistic regression trackers and other advanced trackers, the experiments show that the proposed tracker achieves outstanding performance across the six datasets, including GOT-10k, LaSOT, TrackingNet, UAV123, OTB-100 and VOT2018.
KW - Siamese network
KW - Visual tracking
KW - overlap maximization
KW - probabilistic regression
UR - http://www.scopus.com/inward/record.url?scp=85162854640&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2023.3275573
DO - 10.1109/TCSVT.2023.3275573
M3 - Article
AN - SCOPUS:85162854640
SN - 1051-8215
VL - 33
SP - 7554
EP - 7564
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 12
ER -