跳到主要导航 跳到搜索 跳到主要内容

Generating image description by modeling spatial context of an image

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Generating the descriptive sentences of a real image is a challenging task in image understanding. The difficulty mainly lies in recognizing the interaction activities between objects, and predicting the relationship between objects and stuff/scene. In this paper, we propose a framework for improving image description generation by addressing the above problems. Our framework mainly includes two models: a unified spatial context model and an image description generation model. The former, as the centerpiece of our framework, models 3D spatial context to learn the human-object interaction activities and predict the semantic relationship between these activities and stuff/scene. The spatial context model casts the problems as latent structured labeling problems, and can be resolved by a unified mathematical optimization. Then based on the semantic relationship, the image description generation model generates image descriptive sentences through the proposed lexicalized tree-based algorithm. Experiments on a joint dataset show that our framework outperforms state-of-the-art methods in spatial co-occurrence context analysis, the human-object interaction recognition, and the image description generation.

源语言英语
主期刊名2015 International Joint Conference on Neural Networks, IJCNN 2015
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781479919604, 9781479919604, 9781479919604, 9781479919604
DOI
出版状态已出版 - 28 9月 2015
活动International Joint Conference on Neural Networks, IJCNN 2015 - Killarney, 爱尔兰
期限: 12 7月 201517 7月 2015

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks
2015-September

会议

会议International Joint Conference on Neural Networks, IJCNN 2015
国家/地区爱尔兰
Killarney
时期12/07/1517/07/15

指纹

探究 'Generating image description by modeling spatial context of an image' 的科研主题。它们共同构成独一无二的指纹。

引用此