TY - JOUR
T1 - Global relative position space based pooling for fine-grained vehicle recognition
AU - Xiang, Ye
AU - Fu, Ying
AU - Huang, Hua
N1 - Publisher Copyright:
© 2019
PY - 2019/11/20
Y1 - 2019/11/20
N2 - In the field of fine-grained vehicle recognition, large intra-class variation often affects the final prediction seriously. One of the principal reasons is unstable change of absolute positions for discriminative features, which can not be completely solved by the region of interest (ROI) localization for inevitable deviations. In this paper, we propose a global relative position space based pooling (GRPSP) method to replace the absolute position information with global relative position information, by which the discriminative features can be well aligned and thereby the intra-class variation is reduced effectively. Specifically, the discriminative features containing both absolute position information and appearance information are extracted by a convolution neural network (CNN). We then transform the absolute positions into global relative positions, and estimate position probabilities of locating at discrete representatives of global relative position space. Accordingly, the joint probabilities of different kinds of features locating at different representative positions can be calculated, and constitute the pooling vector. Lastly, the Support Vector Machine (SVM) is trained for classification. Experimental results on public and self-building challenging datasets both show that our proposed method can improve various CNNs and is the state-of-the-art.
AB - In the field of fine-grained vehicle recognition, large intra-class variation often affects the final prediction seriously. One of the principal reasons is unstable change of absolute positions for discriminative features, which can not be completely solved by the region of interest (ROI) localization for inevitable deviations. In this paper, we propose a global relative position space based pooling (GRPSP) method to replace the absolute position information with global relative position information, by which the discriminative features can be well aligned and thereby the intra-class variation is reduced effectively. Specifically, the discriminative features containing both absolute position information and appearance information are extracted by a convolution neural network (CNN). We then transform the absolute positions into global relative positions, and estimate position probabilities of locating at discrete representatives of global relative position space. Accordingly, the joint probabilities of different kinds of features locating at different representative positions can be calculated, and constitute the pooling vector. Lastly, the Support Vector Machine (SVM) is trained for classification. Experimental results on public and self-building challenging datasets both show that our proposed method can improve various CNNs and is the state-of-the-art.
KW - Convolution neural network (CNN)
KW - Fine-grained vehicle recognition
KW - Global relative position space based pooling (GRPSP)
UR - http://www.scopus.com/inward/record.url?scp=85070884787&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2019.07.098
DO - 10.1016/j.neucom.2019.07.098
M3 - Article
AN - SCOPUS:85070884787
SN - 0925-2312
VL - 367
SP - 287
EP - 298
JO - Neurocomputing
JF - Neurocomputing
ER -