Learning from label proportions on high-dimensional data

Yong Shi, Jiabin Liu, Zhiquan Qi*, Bo Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

19 引用 (Scopus)

摘要

Learning from label proportions (LLP), in which the training data is in the form of bags and only the proportion of each class in each bag is available, has attracted wide interest in machine learning. However, how to solve high-dimensional LLP problem is still a challenging task. In this paper, we propose a novel algorithm called learning from label proportions based on random forests (LLP-RF), which has the advantage of dealing with high-dimensional LLP problem. First, by defining the hidden class labels inside target bags as random variables, we formulate a robust loss function based on random forests and take the corresponding proportion information into LLP-RF by penalizing the difference between the ground truth and estimated label proportion. Second, a simple but efficient alternating annealing method is employed to solve the corresponding optimization model. At last, various experiments demonstrate that our algorithm can obtain the best accuracies on high-dimensional data compared with several recently developed methods.

源语言英语
页(从-至)9-18
页数10
期刊Neural Networks
103
DOI
出版状态已出版 - 7月 2018
已对外发布

指纹

探究 'Learning from label proportions on high-dimensional data' 的科研主题。它们共同构成独一无二的指纹。

引用此