Unsupervised Style Control for Image Captioning

Junyu Tian, Zhikun Yang, Shumin Shi*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

We propose a novel unsupervised image captioning method. Image captioning involves two fields of deep learning, natural language processing and computer vision. The excessive pursuit of model evaluation results makes the caption style generated by the model too monotonous, which is difficult to meet people’s demands for vivid and stylized image captions. Therefore, we propose an image captioning model that combines text style transfer and image emotion recognition methods, with which the model can better understand images and generate controllable stylized captions. The proposed method can automatically judge the emotion contained in the image through the image emotion recognition module, better understand the image content, and control the description through the text style transfer method, thereby generating captions that meet people’s expectations. To our knowledge, this is the first work to use both image emotion recognition and text style control.

Original languageEnglish
Title of host publicationData Science - 8th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2022, Proceedings
EditorsYang Wang, Guobin Zhu, Qilong Han, Hongzhi Wang, Xianhua Song, Zeguang Lu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages413-424
Number of pages12
ISBN (Print)9789811951930
DOIs
Publication statusPublished - 2022
Event8th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2022 - Chengdu, China
Duration: 19 Aug 202222 Aug 2022

Publication series

NameCommunications in Computer and Information Science
Volume1628 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference8th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2022
Country/TerritoryChina
CityChengdu
Period19/08/2222/08/22

Keywords

  • Image caption
  • Image sentiment recognization
  • Text style transfer

Fingerprint

Dive into the research topics of 'Unsupervised Style Control for Image Captioning'. Together they form a unique fingerprint.

Cite this