Multi-label learning with missing and completely unobserved labels

Jun Huang; Linchuan Xu; Kun Qian; Jing Wang; Kenji Yamanishi

doi:10.1007/s10618-021-00743-x

Multi-label learning with missing and completely unobserved labels

Jun Huang^*, Linchuan Xu, Kun Qian, Jing Wang, Kenji Yamanishi

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

22 Citations (Scopus)

Abstract

Multi-label learning deals with data examples which are associated with multiple class labels simultaneously. Despite the success of existing approaches to multi-label learning, there is still a problem neglected by researchers, i.e., not only are some of the values of observed labels missing, but also some of the labels are completely unobserved for the training data. We refer to the problem as multi-label learning with missing and completely unobserved labels, and argue that it is necessary to discover these completely unobserved labels in order to mine useful knowledge and make a deeper understanding of what is behind the data. In this paper, we propose a new approach named MCUL to solve multi-label learning with Missing and Completely Unobserved Labels. We try to discover the unobserved labels of a multi-label data set with a clustering based regularization term and describe the semantic meanings of them based on the label-specific features learned by MCUL, and overcome the problem of missing labels by exploiting label correlations. The proposed method MCUL can predict both the observed and newly discovered labels simultaneously for unseen data examples. Experimental results validated over ten benchmark datasets demonstrate that the proposed method can outperform other state-of-the-art approaches on observed labels and obtain an acceptable performance on the new discovered labels as well.

Original language	English
Pages (from-to)	1061-1086
Number of pages	26
Journal	Data Mining and Knowledge Discovery
Volume	35
Issue number	3
DOIs	https://doi.org/10.1007/s10618-021-00743-x
Publication status	Published - May 2021
Externally published	Yes

Keywords

Completely unobserved labels
Discovering new labels
Missing labels
Multi-label learning
Unseen labels

Access to Document

10.1007/s10618-021-00743-x

Cite this

@article{216fd91cbfdc450db7ea9e29b871e9cc,

title = "Multi-label learning with missing and completely unobserved labels",

abstract = "Multi-label learning deals with data examples which are associated with multiple class labels simultaneously. Despite the success of existing approaches to multi-label learning, there is still a problem neglected by researchers, i.e., not only are some of the values of observed labels missing, but also some of the labels are completely unobserved for the training data. We refer to the problem as multi-label learning with missing and completely unobserved labels, and argue that it is necessary to discover these completely unobserved labels in order to mine useful knowledge and make a deeper understanding of what is behind the data. In this paper, we propose a new approach named MCUL to solve multi-label learning with Missing and Completely Unobserved Labels. We try to discover the unobserved labels of a multi-label data set with a clustering based regularization term and describe the semantic meanings of them based on the label-specific features learned by MCUL, and overcome the problem of missing labels by exploiting label correlations. The proposed method MCUL can predict both the observed and newly discovered labels simultaneously for unseen data examples. Experimental results validated over ten benchmark datasets demonstrate that the proposed method can outperform other state-of-the-art approaches on observed labels and obtain an acceptable performance on the new discovered labels as well.",

keywords = "Completely unobserved labels, Discovering new labels, Missing labels, Multi-label learning, Unseen labels",

author = "Jun Huang and Linchuan Xu and Kun Qian and Jing Wang and Kenji Yamanishi",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = may,

doi = "10.1007/s10618-021-00743-x",

language = "English",

volume = "35",

pages = "1061--1086",

journal = "Data Mining and Knowledge Discovery",

issn = "1384-5810",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Multi-label learning with missing and completely unobserved labels

AU - Huang, Jun

AU - Xu, Linchuan

AU - Qian, Kun

AU - Wang, Jing

AU - Yamanishi, Kenji

PY - 2021/5

Y1 - 2021/5

N2 - Multi-label learning deals with data examples which are associated with multiple class labels simultaneously. Despite the success of existing approaches to multi-label learning, there is still a problem neglected by researchers, i.e., not only are some of the values of observed labels missing, but also some of the labels are completely unobserved for the training data. We refer to the problem as multi-label learning with missing and completely unobserved labels, and argue that it is necessary to discover these completely unobserved labels in order to mine useful knowledge and make a deeper understanding of what is behind the data. In this paper, we propose a new approach named MCUL to solve multi-label learning with Missing and Completely Unobserved Labels. We try to discover the unobserved labels of a multi-label data set with a clustering based regularization term and describe the semantic meanings of them based on the label-specific features learned by MCUL, and overcome the problem of missing labels by exploiting label correlations. The proposed method MCUL can predict both the observed and newly discovered labels simultaneously for unseen data examples. Experimental results validated over ten benchmark datasets demonstrate that the proposed method can outperform other state-of-the-art approaches on observed labels and obtain an acceptable performance on the new discovered labels as well.

AB - Multi-label learning deals with data examples which are associated with multiple class labels simultaneously. Despite the success of existing approaches to multi-label learning, there is still a problem neglected by researchers, i.e., not only are some of the values of observed labels missing, but also some of the labels are completely unobserved for the training data. We refer to the problem as multi-label learning with missing and completely unobserved labels, and argue that it is necessary to discover these completely unobserved labels in order to mine useful knowledge and make a deeper understanding of what is behind the data. In this paper, we propose a new approach named MCUL to solve multi-label learning with Missing and Completely Unobserved Labels. We try to discover the unobserved labels of a multi-label data set with a clustering based regularization term and describe the semantic meanings of them based on the label-specific features learned by MCUL, and overcome the problem of missing labels by exploiting label correlations. The proposed method MCUL can predict both the observed and newly discovered labels simultaneously for unseen data examples. Experimental results validated over ten benchmark datasets demonstrate that the proposed method can outperform other state-of-the-art approaches on observed labels and obtain an acceptable performance on the new discovered labels as well.

KW - Completely unobserved labels

KW - Discovering new labels

KW - Missing labels

KW - Multi-label learning

KW - Unseen labels

UR - http://www.scopus.com/inward/record.url?scp=85102558149&partnerID=8YFLogxK

U2 - 10.1007/s10618-021-00743-x

DO - 10.1007/s10618-021-00743-x

M3 - Article

AN - SCOPUS:85102558149

SN - 1384-5810

VL - 35

SP - 1061

EP - 1086

JO - Data Mining and Knowledge Discovery

JF - Data Mining and Knowledge Discovery

IS - 3

ER -

Multi-label learning with missing and completely unobserved labels

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this