Skip to main navigation Skip to search Skip to main content

Picture News Collection: A Dataset for Automatic Picture News Thumbnail Selection

  • Beijing Institute of Technology
  • Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Picture news has become more and more popular among online news in recent years. As the first impression to viewers, thumbnail plays a very important role in picture news. However, it is time consuming to manually select thumbnails for a huge amount of picture news. In this paper, we introduce a new task of automatic picture news thumbnail selection. Given a piece of picture news containing a set of images, this task is to select several appropriate images from the picture news as candidate thumbnails. To this end, we present a large publicly available image dataset for this task, called Picture News Collection(The Picture News Collection 0.1 version can be publicly available online at https://github.com/anonymity01/Picture-News-Collection.). The Picture News Collection contains more than 4 million images of 347,731 picture news from two famous news websites, Sina News and NetEase News. Selecting good enough thumbnails is complicated and needs to consider many aspects, such as attraction, hot topics, content integrity, etc. In order to select appropriate candidate thumbnails, we propose an attention-based thumbnail selection model, and the experimental results comparing with three image classification based baselines show that our proposed methods outperform the baselines. We introduce the automatic picture news thumbnail selection task and the dataset to encourage further studies of this challenge.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2019 - 20th International Conference, Proceedings
EditorsReynold Cheng, Nikos Mamoulis, Yizhou Sun, Xin Huang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages458-472
Number of pages15
ISBN (Print)9783030342227
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event20th International Conference on Web Information Systems Engineering, WISE 2019 - Hongkong, China
Duration: 19 Jan 202022 Jan 2020

Publication series

NameLecture Notes in Computer Science
Volume11881 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Web Information Systems Engineering, WISE 2019
Country/TerritoryChina
CityHongkong
Period19/01/2022/01/20

Keywords

  • Automatic picture news thumbnail selection
  • Image selection
  • Picture News Collection

Fingerprint

Dive into the research topics of 'Picture News Collection: A Dataset for Automatic Picture News Thumbnail Selection'. Together they form a unique fingerprint.

Cite this