Recognizing visual composite in real images

Lin Bai; Kan Li; Shuai Jiang

doi:10.1109/IJCNN.2015.7280523

Recognizing visual composite in real images

Lin Bai, Kan Li, Shuai Jiang

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

Automatically discovering and recognizing the main structured visual pattern of an image is a challenging problem. The most difficulties are how to find the component objects and how to recognize the interaction among these objects. The component objects of the structured visual pattern have consistent 3D spatial co-occurrence layout across images, which manifest themselves as a predictable pattern called visual composite. In this paper, we propose a visual composite recognition model to automatically discover and recognize the visual composite of an image. Our model firstly learns 3D spatial co-occurrence statistics among objects to discover the potential structured visual pattern of an image so that it captures the component objects of visual composite. Secondly, we construct a feedforward architecture using the proposed factored three-way interaction machine to recognize the visual composite, which casts the recognition problem as a structured prediction task. It predicts the visual composite by maximizing the probability of the correct structured label given the component objects and their 3D spatial context. Experiments conducted on a six-class sports dataset and a phrasal recognition dataset respectively demonstrate the encouraging performance of our model in discovery precision and recognition accuracy compared with competing approaches.

源语言	英语
主期刊名	2015 International Joint Conference on Neural Networks, IJCNN 2015
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9781479919604, 9781479919604, 9781479919604, 9781479919604
DOI	https://doi.org/10.1109/IJCNN.2015.7280523
出版状态	已出版 - 28 9月 2015
活动	International Joint Conference on Neural Networks, IJCNN 2015 - Killarney, 爱尔兰期限: 12 7月 2015 → 17 7月 2015

出版系列

姓名	Proceedings of the International Joint Conference on Neural Networks
卷	2015-September

会议

会议	International Joint Conference on Neural Networks, IJCNN 2015
国家/地区	爱尔兰
市	Killarney
时期	12/07/15 → 17/07/15

访问文件

10.1109/IJCNN.2015.7280523

其它文件与链接

链接到 Scopus 的出版物

引用此

Bai, L., Li, K., & Jiang, S. (2015). Recognizing visual composite in real images. 在 2015 International Joint Conference on Neural Networks, IJCNN 2015 文章 7280523 (Proceedings of the International Joint Conference on Neural Networks; 卷 2015-September). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2015.7280523

@inproceedings{03d8ac1bf49d4586ad1672e1f8649369,

title = "Recognizing visual composite in real images",

abstract = "Automatically discovering and recognizing the main structured visual pattern of an image is a challenging problem. The most difficulties are how to find the component objects and how to recognize the interaction among these objects. The component objects of the structured visual pattern have consistent 3D spatial co-occurrence layout across images, which manifest themselves as a predictable pattern called visual composite. In this paper, we propose a visual composite recognition model to automatically discover and recognize the visual composite of an image. Our model firstly learns 3D spatial co-occurrence statistics among objects to discover the potential structured visual pattern of an image so that it captures the component objects of visual composite. Secondly, we construct a feedforward architecture using the proposed factored three-way interaction machine to recognize the visual composite, which casts the recognition problem as a structured prediction task. It predicts the visual composite by maximizing the probability of the correct structured label given the component objects and their 3D spatial context. Experiments conducted on a six-class sports dataset and a phrasal recognition dataset respectively demonstrate the encouraging performance of our model in discovery precision and recognition accuracy compared with competing approaches.",

keywords = "Computational modeling, Image recognition, Three-dimensional displays, Visualization",

author = "Lin Bai and Kan Li and Shuai Jiang",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; International Joint Conference on Neural Networks, IJCNN 2015 ; Conference date: 12-07-2015 Through 17-07-2015",

year = "2015",

month = sep,

day = "28",

doi = "10.1109/IJCNN.2015.7280523",

language = "English",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2015 International Joint Conference on Neural Networks, IJCNN 2015",

address = "United States",

}

Bai, L, Li, K & Jiang, S 2015, Recognizing visual composite in real images. 在 2015 International Joint Conference on Neural Networks, IJCNN 2015., 7280523, Proceedings of the International Joint Conference on Neural Networks, 卷 2015-September, Institute of Electrical and Electronics Engineers Inc., International Joint Conference on Neural Networks, IJCNN 2015, Killarney, 爱尔兰, 12/07/15. https://doi.org/10.1109/IJCNN.2015.7280523

Recognizing visual composite in real images. / Bai, Lin; Li, Kan; Jiang, Shuai.
2015 International Joint Conference on Neural Networks, IJCNN 2015. Institute of Electrical and Electronics Engineers Inc., 2015. 7280523 (Proceedings of the International Joint Conference on Neural Networks; 卷 2015-September).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Recognizing visual composite in real images

AU - Bai, Lin

AU - Li, Kan

AU - Jiang, Shuai

PY - 2015/9/28

Y1 - 2015/9/28

N2 - Automatically discovering and recognizing the main structured visual pattern of an image is a challenging problem. The most difficulties are how to find the component objects and how to recognize the interaction among these objects. The component objects of the structured visual pattern have consistent 3D spatial co-occurrence layout across images, which manifest themselves as a predictable pattern called visual composite. In this paper, we propose a visual composite recognition model to automatically discover and recognize the visual composite of an image. Our model firstly learns 3D spatial co-occurrence statistics among objects to discover the potential structured visual pattern of an image so that it captures the component objects of visual composite. Secondly, we construct a feedforward architecture using the proposed factored three-way interaction machine to recognize the visual composite, which casts the recognition problem as a structured prediction task. It predicts the visual composite by maximizing the probability of the correct structured label given the component objects and their 3D spatial context. Experiments conducted on a six-class sports dataset and a phrasal recognition dataset respectively demonstrate the encouraging performance of our model in discovery precision and recognition accuracy compared with competing approaches.

AB - Automatically discovering and recognizing the main structured visual pattern of an image is a challenging problem. The most difficulties are how to find the component objects and how to recognize the interaction among these objects. The component objects of the structured visual pattern have consistent 3D spatial co-occurrence layout across images, which manifest themselves as a predictable pattern called visual composite. In this paper, we propose a visual composite recognition model to automatically discover and recognize the visual composite of an image. Our model firstly learns 3D spatial co-occurrence statistics among objects to discover the potential structured visual pattern of an image so that it captures the component objects of visual composite. Secondly, we construct a feedforward architecture using the proposed factored three-way interaction machine to recognize the visual composite, which casts the recognition problem as a structured prediction task. It predicts the visual composite by maximizing the probability of the correct structured label given the component objects and their 3D spatial context. Experiments conducted on a six-class sports dataset and a phrasal recognition dataset respectively demonstrate the encouraging performance of our model in discovery precision and recognition accuracy compared with competing approaches.

KW - Computational modeling

KW - Image recognition

KW - Three-dimensional displays

KW - Visualization

UR - http://www.scopus.com/inward/record.url?scp=84951028520&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2015.7280523

DO - 10.1109/IJCNN.2015.7280523

M3 - Conference contribution

AN - SCOPUS:84951028520

T3 - Proceedings of the International Joint Conference on Neural Networks

BT - 2015 International Joint Conference on Neural Networks, IJCNN 2015

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - International Joint Conference on Neural Networks, IJCNN 2015

Y2 - 12 July 2015 through 17 July 2015

ER -

Recognizing visual composite in real images

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此