Deep unsupervised active learning via matrix sketching

Changsheng Li*, Rongqing Li, Ye Yuan, Guoren Wang, Dong Xu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

—Most existing unsupervised active learning methods aim at minimizing the data reconstruction loss by using the linear models to choose representative samples for manually labeling in an unsupervised setting. Thus these methods often fail in modelling data with complex non-linear structure. To address this issue, we propose a new deep unsupervised Active Learning method for classification tasks, inspired by the idea of Matrix Sketching, called ALMS. Specifically, ALMS leverages a deep auto-encoder to embed data into a latent space, and then describes all the embedded data with a small size sketch to summarize the major characteristics of the data. In contrast to previous approaches that reconstruct the whole data matrix for selecting the representative samples, ALMS aims to select a representative subset of samples to well approximate the sketch, which can preserve the major information of data meanwhile significantly reducing the number of network parameters. This makes our algorithm alleviate the issue of model overfitting and readily cope with large datasets. Actually, the sketch provides a type of self-supervised signal to guide the learning of the model. Moreover, we propose to construct an auxiliary self-supervised task by classifying real/fake samples, in order to further improve the representation ability of the encoder. We thoroughly evaluate the performance of ALMS on both single-label and multi-label classification tasks, and the results demonstrate its superior performance against the state-of-the-art methods. The code can be found at https://github.com/lrq99/ALMS.

Original languageEnglish
Pages (from-to)9280-9293
Number of pages14
JournalIEEE Transactions on Image Processing
Volume30
DOIs
Publication statusPublished - 2021

Keywords

  • Data reconstruction
  • Matrix sketching
  • Self-supervised learning
  • Unsupervised active learning

Fingerprint

Dive into the research topics of 'Deep unsupervised active learning via matrix sketching'. Together they form a unique fingerprint.

Cite this