Image captioning with relational knowledge

Huan Yang; Dandan Song; Lejian Liao

doi:10.1007/978-3-319-97310-4_43

Image captioning with relational knowledge

Huan Yang, Dandan Song^*, Lejian Liao

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

2 Citations (Scopus)

Abstract

People have learned extensive relational knowledge from daily life. This is one of the facts that enables human to describe the information from images easily. In this paper, we propose a novel framework called Image Captioning with Relational Knowledge (ICRK) that combines relational knowledge with image captioning model and utilizes relational knowledge to strengthen the learning process of representing words. As more precise syntactic and semantic word relationships were learned, the image captioning model acquires more semantic features that help to generate more accurate image descriptions. Experiments on several benchmark datasets, using automatic evaluation metrics, have all demonstrated that our model can significantly improve the quality of image captioning.

Original language	English
Title of host publication	PRICAI 2018
Subtitle of host publication	Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings
Editors	Xin Geng, Byeong-Ho Kang
Publisher	Springer Verlag
Pages	378-386
Number of pages	9
ISBN (Print)	9783319973098
DOIs	https://doi.org/10.1007/978-3-319-97310-4_43
Publication status	Published - 2018
Event	15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018 - Nanjing, China Duration: 28 Aug 2018 → 31 Aug 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11013 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018
Country/Territory	China
City	Nanjing
Period	28/08/18 → 31/08/18

Keywords

Image captioning
Relational knowledge
Word embedding

Access to Document

10.1007/978-3-319-97310-4_43

Cite this

Yang, H., Song, D., & Liao, L. (2018). Image captioning with relational knowledge. In X. Geng, & B.-H. Kang (Eds.), PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings (pp. 378-386). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11013 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-319-97310-4_43

Yang, Huan ; Song, Dandan ; Liao, Lejian. / Image captioning with relational knowledge. PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. editor / Xin Geng ; Byeong-Ho Kang. Springer Verlag, 2018. pp. 378-386 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{ae69043b3c284a5fa16a5609a47e153c,

title = "Image captioning with relational knowledge",

abstract = "People have learned extensive relational knowledge from daily life. This is one of the facts that enables human to describe the information from images easily. In this paper, we propose a novel framework called Image Captioning with Relational Knowledge (ICRK) that combines relational knowledge with image captioning model and utilizes relational knowledge to strengthen the learning process of representing words. As more precise syntactic and semantic word relationships were learned, the image captioning model acquires more semantic features that help to generate more accurate image descriptions. Experiments on several benchmark datasets, using automatic evaluation metrics, have all demonstrated that our model can significantly improve the quality of image captioning.",

keywords = "Image captioning, Relational knowledge, Word embedding",

author = "Huan Yang and Dandan Song and Lejian Liao",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG, part of Springer Nature 2018.; 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018 ; Conference date: 28-08-2018 Through 31-08-2018",

year = "2018",

doi = "10.1007/978-3-319-97310-4_43",

language = "English",

isbn = "9783319973098",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "378--386",

editor = "Xin Geng and Byeong-Ho Kang",

booktitle = "PRICAI 2018",

address = "Germany",

}

Yang, H, Song, D & Liao, L 2018, Image captioning with relational knowledge. in X Geng & B-H Kang (eds), PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11013 LNAI, Springer Verlag, pp. 378-386, 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018, Nanjing, China, 28/08/18. https://doi.org/10.1007/978-3-319-97310-4_43

Image captioning with relational knowledge. / Yang, Huan; Song, Dandan; Liao, Lejian.
PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. ed. / Xin Geng; Byeong-Ho Kang. Springer Verlag, 2018. p. 378-386 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11013 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Image captioning with relational knowledge

AU - Yang, Huan

AU - Song, Dandan

AU - Liao, Lejian

N1 - Publisher Copyright: © Springer International Publishing AG, part of Springer Nature 2018.

PY - 2018

Y1 - 2018

N2 - People have learned extensive relational knowledge from daily life. This is one of the facts that enables human to describe the information from images easily. In this paper, we propose a novel framework called Image Captioning with Relational Knowledge (ICRK) that combines relational knowledge with image captioning model and utilizes relational knowledge to strengthen the learning process of representing words. As more precise syntactic and semantic word relationships were learned, the image captioning model acquires more semantic features that help to generate more accurate image descriptions. Experiments on several benchmark datasets, using automatic evaluation metrics, have all demonstrated that our model can significantly improve the quality of image captioning.

AB - People have learned extensive relational knowledge from daily life. This is one of the facts that enables human to describe the information from images easily. In this paper, we propose a novel framework called Image Captioning with Relational Knowledge (ICRK) that combines relational knowledge with image captioning model and utilizes relational knowledge to strengthen the learning process of representing words. As more precise syntactic and semantic word relationships were learned, the image captioning model acquires more semantic features that help to generate more accurate image descriptions. Experiments on several benchmark datasets, using automatic evaluation metrics, have all demonstrated that our model can significantly improve the quality of image captioning.

KW - Image captioning

KW - Relational knowledge

KW - Word embedding

UR - http://www.scopus.com/inward/record.url?scp=85051987410&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-97310-4_43

DO - 10.1007/978-3-319-97310-4_43

M3 - Conference contribution

AN - SCOPUS:85051987410

SN - 9783319973098

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 378

EP - 386

BT - PRICAI 2018

A2 - Geng, Xin

A2 - Kang, Byeong-Ho

PB - Springer Verlag

T2 - 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018

Y2 - 28 August 2018 through 31 August 2018

ER -

Yang H, Song D, Liao L. Image captioning with relational knowledge. In Geng X, Kang BH, editors, PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. Springer Verlag. 2018. p. 378-386. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-97310-4_43

Image captioning with relational knowledge

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this