A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs

Bo Xiao; Yujiao Du; Q. M.Jonathan Wu; Qianfang Xu; Liping Yan

doi:10.1109/ACCESS.2019.2935175

A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs

Bo Xiao, Yujiao Du, Q. M.Jonathan Wu, Qianfang Xu, Liping Yan

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Zero-shot learning aims to recognize unseen categories by learning an embedding space between data samples and semantic representations. For the large-scale datasets with thousands of categories, embedding vectors of category labels are often used for semantic representation since it is difficult to define the semantic attributes of categories manually. Facing the problem of underutilization of prior knowledge during the construction of embedding vectors, this paper first constructs a novel knowledge graph as the supplement to the basicWordNet graph, and then proposes a fast hybrid model ARGCN-DKG, which means Attention based Residual Graph Convolutional Network on Different types of Knowledge Graphs. By introducing residual mechanism and attention mechanism, and integrating different knowledge graphs, the accuracy of knowledge transfer between different categories can be improved. Our model only use 2-layer GCN, the pretrained image features and category semantic features, so the training process could be done in minitues on single GPU, which could be one of the fastest training models for large-scale image recognition. Experiment results demonstrate that ARGCN-DKG model could get better results for large-scale datasets than the state-of-the-art model.

源语言	英语
文章编号	2935175
页（从-至）	119309-119318
页数	10
期刊	IEEE Access
卷	7
DOI	https://doi.org/10.1109/ACCESS.2019.2935175
出版状态	已出版 - 2019

访问文件

10.1109/ACCESS.2019.2935175

其它文件与链接

链接到 Scopus 的出版物

引用此

Xiao, B., Du, Y., Wu, Q. M. J., Xu, Q., & Yan, L. (2019). A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs. IEEE Access, 7, 119309-119318. 文章 2935175. https://doi.org/10.1109/ACCESS.2019.2935175

@article{c010585224fe4388a89c70560a278133,

title = "A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs",

abstract = "Zero-shot learning aims to recognize unseen categories by learning an embedding space between data samples and semantic representations. For the large-scale datasets with thousands of categories, embedding vectors of category labels are often used for semantic representation since it is difficult to define the semantic attributes of categories manually. Facing the problem of underutilization of prior knowledge during the construction of embedding vectors, this paper first constructs a novel knowledge graph as the supplement to the basicWordNet graph, and then proposes a fast hybrid model ARGCN-DKG, which means Attention based Residual Graph Convolutional Network on Different types of Knowledge Graphs. By introducing residual mechanism and attention mechanism, and integrating different knowledge graphs, the accuracy of knowledge transfer between different categories can be improved. Our model only use 2-layer GCN, the pretrained image features and category semantic features, so the training process could be done in minitues on single GPU, which could be one of the fastest training models for large-scale image recognition. Experiment results demonstrate that ARGCN-DKG model could get better results for large-scale datasets than the state-of-the-art model.",

keywords = "Attention model, Graph convolutional network, Residual network, WordNet, Zero-shot learning",

author = "Bo Xiao and Yujiao Du and Wu, {Q. M.Jonathan} and Qianfang Xu and Liping Yan",

year = "2019",

doi = "10.1109/ACCESS.2019.2935175",

language = "English",

volume = "7",

pages = "119309--119318",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs

AU - Xiao, Bo

AU - Du, Yujiao

AU - Wu, Q. M.Jonathan

AU - Xu, Qianfang

AU - Yan, Liping

PY - 2019

Y1 - 2019

N2 - Zero-shot learning aims to recognize unseen categories by learning an embedding space between data samples and semantic representations. For the large-scale datasets with thousands of categories, embedding vectors of category labels are often used for semantic representation since it is difficult to define the semantic attributes of categories manually. Facing the problem of underutilization of prior knowledge during the construction of embedding vectors, this paper first constructs a novel knowledge graph as the supplement to the basicWordNet graph, and then proposes a fast hybrid model ARGCN-DKG, which means Attention based Residual Graph Convolutional Network on Different types of Knowledge Graphs. By introducing residual mechanism and attention mechanism, and integrating different knowledge graphs, the accuracy of knowledge transfer between different categories can be improved. Our model only use 2-layer GCN, the pretrained image features and category semantic features, so the training process could be done in minitues on single GPU, which could be one of the fastest training models for large-scale image recognition. Experiment results demonstrate that ARGCN-DKG model could get better results for large-scale datasets than the state-of-the-art model.

AB - Zero-shot learning aims to recognize unseen categories by learning an embedding space between data samples and semantic representations. For the large-scale datasets with thousands of categories, embedding vectors of category labels are often used for semantic representation since it is difficult to define the semantic attributes of categories manually. Facing the problem of underutilization of prior knowledge during the construction of embedding vectors, this paper first constructs a novel knowledge graph as the supplement to the basicWordNet graph, and then proposes a fast hybrid model ARGCN-DKG, which means Attention based Residual Graph Convolutional Network on Different types of Knowledge Graphs. By introducing residual mechanism and attention mechanism, and integrating different knowledge graphs, the accuracy of knowledge transfer between different categories can be improved. Our model only use 2-layer GCN, the pretrained image features and category semantic features, so the training process could be done in minitues on single GPU, which could be one of the fastest training models for large-scale image recognition. Experiment results demonstrate that ARGCN-DKG model could get better results for large-scale datasets than the state-of-the-art model.

KW - Attention model

KW - Graph convolutional network

KW - Residual network

KW - WordNet

KW - Zero-shot learning

UR - http://www.scopus.com/inward/record.url?scp=85097332022&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2019.2935175

DO - 10.1109/ACCESS.2019.2935175

M3 - Article

AN - SCOPUS:85097332022

SN - 2169-3536

VL - 7

SP - 119309

EP - 119318

JO - IEEE Access

JF - IEEE Access

M1 - 2935175

ER -

A fast hybrid model for large-scale zero-shot image recognition based on knowledge graphs

摘要

访问文件

其它文件与链接

指纹

引用此