TY - JOUR
T1 - Group-sensitive triplet embedding for vehicle reidentification
AU - Bai, Yan
AU - Lou, Yihang
AU - Gao, Feng
AU - Wang, Shiqi
AU - Wu, Yuwei
AU - Duan, Ling Yu
N1 - Publisher Copyright:
© 1999-2012 IEEE.
PY - 2018/9
Y1 - 2018/9
N2 - The widespread use of surveillance cameras toward smart and safe cities poses the critical but challenging problem of vehicle reidentification (Re-ID). The state-of-the-art research work performed vehicle Re-ID relying on deep metric learning with a triplet network. However, most existing methods basically ignore the impact of intraclass variance-incorporated embedding on the performance of vehicle reidentification, in which robust fine-grained features for large-scale vehicle Re-ID have not been fully studied. In this paper, we propose a deep metric learning method, group-sensitive-triplet embedding (GS-TRE), to recognize and retrieve vehicles, in which intraclass variance is elegantly modeled by incorporating an intermediate representation 'group' between samples and each individual vehicle in the triplet network learning. To capture the intraclass variance attributes of each individual vehicle, we utilize an online grouping method to partition samples within each vehicle ID into a few groups, and build up the triplet samples at multiple granularities across different vehicle IDs as well as different groups within the same vehicle ID to learn fine-grained features. In particular, we construct a large-scale vehicle database 'PKU-Vehicle,' consisting of 10 million vehicle images captured by different surveillance cameras in several cities, to evaluate the vehicle Re-ID performance in real-world video surveillance applications. Extensive experiments over benchmark datasets VehicleID, VeRI, and CompCar have shown that the proposed GS-TRE significantly outperforms the state-of-the-art approaches for vehicle Re-ID.
AB - The widespread use of surveillance cameras toward smart and safe cities poses the critical but challenging problem of vehicle reidentification (Re-ID). The state-of-the-art research work performed vehicle Re-ID relying on deep metric learning with a triplet network. However, most existing methods basically ignore the impact of intraclass variance-incorporated embedding on the performance of vehicle reidentification, in which robust fine-grained features for large-scale vehicle Re-ID have not been fully studied. In this paper, we propose a deep metric learning method, group-sensitive-triplet embedding (GS-TRE), to recognize and retrieve vehicles, in which intraclass variance is elegantly modeled by incorporating an intermediate representation 'group' between samples and each individual vehicle in the triplet network learning. To capture the intraclass variance attributes of each individual vehicle, we utilize an online grouping method to partition samples within each vehicle ID into a few groups, and build up the triplet samples at multiple granularities across different vehicle IDs as well as different groups within the same vehicle ID to learn fine-grained features. In particular, we construct a large-scale vehicle database 'PKU-Vehicle,' consisting of 10 million vehicle images captured by different surveillance cameras in several cities, to evaluate the vehicle Re-ID performance in real-world video surveillance applications. Extensive experiments over benchmark datasets VehicleID, VeRI, and CompCar have shown that the proposed GS-TRE significantly outperforms the state-of-the-art approaches for vehicle Re-ID.
KW - Vehicle re-identification
KW - embedding
KW - metric learning, intra-class variance
KW - retrieval
KW - surveillance
UR - http://www.scopus.com/inward/record.url?scp=85040947979&partnerID=8YFLogxK
U2 - 10.1109/TMM.2018.2796240
DO - 10.1109/TMM.2018.2796240
M3 - Article
AN - SCOPUS:85040947979
SN - 1520-9210
VL - 20
SP - 2385
EP - 2399
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 9
M1 - 8265213
ER -