TY - JOUR
T1 - Visual Relationship Detection
T2 - A Survey
AU - Cheng, Jun
AU - Wang, Lei
AU - Wu, Jiaji
AU - Hu, Xiping
AU - Jeon, Gwanggil
AU - Tao, Dacheng
AU - Zhou, Mengchu
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2022/8/1
Y1 - 2022/8/1
N2 - Visual relationship detection (VRD) is one newly developed computer vision task, aiming to recognize relations or interactions between objects in an image. It is a further learning task after object recognition, and is important for fully understanding images even the visual world. It has numerous applications, such as image retrieval, machine vision in robotics, visual question answer (VQA), and visual reasoning. However, this problem is difficult since relationships are not definite, and the number of possible relations is much larger than objects. So the complete annotation for visual relationships is much more difficult, making this task hard to learn. Many approaches have been proposed to tackle this problem especially with the development of deep neural networks in recent years. In this survey, we first introduce the background of visual relations. Then, we present categorization and frameworks of deep learning models for visual relationship detection. The high-level applications, benchmark datasets, as well as empirical analysis are also introduced for comprehensive understanding of this task.
AB - Visual relationship detection (VRD) is one newly developed computer vision task, aiming to recognize relations or interactions between objects in an image. It is a further learning task after object recognition, and is important for fully understanding images even the visual world. It has numerous applications, such as image retrieval, machine vision in robotics, visual question answer (VQA), and visual reasoning. However, this problem is difficult since relationships are not definite, and the number of possible relations is much larger than objects. So the complete annotation for visual relationships is much more difficult, making this task hard to learn. Many approaches have been proposed to tackle this problem especially with the development of deep neural networks in recent years. In this survey, we first introduce the background of visual relations. Then, we present categorization and frameworks of deep learning models for visual relationship detection. The high-level applications, benchmark datasets, as well as empirical analysis are also introduced for comprehensive understanding of this task.
KW - Deep learning
KW - detection
KW - neural networks
KW - visual relation
UR - http://www.scopus.com/inward/record.url?scp=85123781319&partnerID=8YFLogxK
U2 - 10.1109/TCYB.2022.3142013
DO - 10.1109/TCYB.2022.3142013
M3 - Article
C2 - 35077387
AN - SCOPUS:85123781319
SN - 2168-2267
VL - 52
SP - 8453
EP - 8466
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 8
ER -