A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs

Bo Xiao, Yujiao Du, Q. M.Jonathan Wu, Qianfang Xu, Liping Yan

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Zero-shot learning aims to recognize unseen categories by learning an embedding space between data samples and semantic representations. For the large-scale datasets with thousands of categories, embedding vectors of category labels are often used for semantic representation since it is difficult to define the semantic attributes of categories manually. Facing the problem of underutilization of prior knowledge during the construction of embedding vectors, this paper first constructs a novel knowledge graph as the supplement to the basicWordNet graph, and then proposes a fast hybrid model ARGCN-DKG, which means Attention based Residual Graph Convolutional Network on Different types of Knowledge Graphs. By introducing residual mechanism and attention mechanism, and integrating different knowledge graphs, the accuracy of knowledge transfer between different categories can be improved. Our model only use 2-layer GCN, the pretrained image features and category semantic features, so the training process could be done in minitues on single GPU, which could be one of the fastest training models for large-scale image recognition. Experiment results demonstrate that ARGCN-DKG model could get better results for large-scale datasets than the state-of-the-art model.

Original languageEnglish
Article number2935175
Pages (from-to)119309-119318
Number of pages10
JournalIEEE Access
Volume7
DOIs
Publication statusPublished - 2019

Keywords

  • Attention model
  • Graph convolutional network
  • Residual network
  • WordNet
  • Zero-shot learning

Fingerprint

Dive into the research topics of 'A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs'. Together they form a unique fingerprint.

Cite this