CRAT-XML: Contrastive Representation Adversarial Training for Extremely Multi-Label Text Classification

Jie Zhu; Heyan Huang; Xian Ling Mao

doi:10.1088/1742-6596/2513/1/012002

CRAT-XML: Contrastive Representation Adversarial Training for Extremely Multi-Label Text Classification

Jie Zhu^*, Heyan Huang, Xian Ling Mao

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Conference article › peer-review

Abstract

Extreme multi-label text classification is a very important and challenging problem in this age of widespread internet access, with a wide range of application scenarios, such as web tagging, legal document annotation, commodity classification, etc. Most of the existing state-of-the-art methods are based on deep learning, however, there are still two problems: 1) the vector representation generated by most pre-trained models suffers from anisotropy and uneven distribution, which has a significant impact on the XMC task. 2) existing models are large in size and use many models for integration. Some of them even took hundreds of hours to train. This seriously affects the efficiency of the experiments. Therefore, in this paper, we propose CRAT-XML, which uses contrast adversarial learning to optimize text representation and enhance the acquisition of dependency relations between text and labels, thus reducing the need for integration at the representation level and achieving relatively high accuracy under low-resource and low-time conditions. Experimental results demonstrate that our model achieves SOTA results on a single model, while achieving a large reduction in training time and model size.

Original language	English
Article number	012002
Journal	Journal of Physics: Conference Series
Volume	2513
Issue number	1
DOIs	https://doi.org/10.1088/1742-6596/2513/1/012002
Publication status	Published - 2023
Event	2023 7th International Conference on Artificial Intelligence, Automation and Control Technologies, AIACT 2023 - Virtual, Online, China Duration: 24 Feb 2023 → 26 Feb 2023

Access to Document

10.1088/1742-6596/2513/1/012002

Cite this

@article{b6ba55f373ca4671a3c0bdf8d7da86ea,

title = "CRAT-XML: Contrastive Representation Adversarial Training for Extremely Multi-Label Text Classification",

abstract = "Extreme multi-label text classification is a very important and challenging problem in this age of widespread internet access, with a wide range of application scenarios, such as web tagging, legal document annotation, commodity classification, etc. Most of the existing state-of-the-art methods are based on deep learning, however, there are still two problems: 1) the vector representation generated by most pre-trained models suffers from anisotropy and uneven distribution, which has a significant impact on the XMC task. 2) existing models are large in size and use many models for integration. Some of them even took hundreds of hours to train. This seriously affects the efficiency of the experiments. Therefore, in this paper, we propose CRAT-XML, which uses contrast adversarial learning to optimize text representation and enhance the acquisition of dependency relations between text and labels, thus reducing the need for integration at the representation level and achieving relatively high accuracy under low-resource and low-time conditions. Experimental results demonstrate that our model achieves SOTA results on a single model, while achieving a large reduction in training time and model size.",

author = "Jie Zhu and Heyan Huang and Mao, {Xian Ling}",

note = "Publisher Copyright: {\textcopyright} Published under licence by IOP Publishing Ltd.; 2023 7th International Conference on Artificial Intelligence, Automation and Control Technologies, AIACT 2023 ; Conference date: 24-02-2023 Through 26-02-2023",

year = "2023",

doi = "10.1088/1742-6596/2513/1/012002",

language = "English",

volume = "2513",

journal = "Journal of Physics: Conference Series",

issn = "1742-6588",

publisher = "IOP Publishing Ltd.",

number = "1",

}

TY - JOUR

T1 - CRAT-XML

T2 - 2023 7th International Conference on Artificial Intelligence, Automation and Control Technologies, AIACT 2023

AU - Zhu, Jie

AU - Huang, Heyan

AU - Mao, Xian Ling

N1 - Publisher Copyright: © Published under licence by IOP Publishing Ltd.

PY - 2023

Y1 - 2023

N2 - Extreme multi-label text classification is a very important and challenging problem in this age of widespread internet access, with a wide range of application scenarios, such as web tagging, legal document annotation, commodity classification, etc. Most of the existing state-of-the-art methods are based on deep learning, however, there are still two problems: 1) the vector representation generated by most pre-trained models suffers from anisotropy and uneven distribution, which has a significant impact on the XMC task. 2) existing models are large in size and use many models for integration. Some of them even took hundreds of hours to train. This seriously affects the efficiency of the experiments. Therefore, in this paper, we propose CRAT-XML, which uses contrast adversarial learning to optimize text representation and enhance the acquisition of dependency relations between text and labels, thus reducing the need for integration at the representation level and achieving relatively high accuracy under low-resource and low-time conditions. Experimental results demonstrate that our model achieves SOTA results on a single model, while achieving a large reduction in training time and model size.

AB - Extreme multi-label text classification is a very important and challenging problem in this age of widespread internet access, with a wide range of application scenarios, such as web tagging, legal document annotation, commodity classification, etc. Most of the existing state-of-the-art methods are based on deep learning, however, there are still two problems: 1) the vector representation generated by most pre-trained models suffers from anisotropy and uneven distribution, which has a significant impact on the XMC task. 2) existing models are large in size and use many models for integration. Some of them even took hundreds of hours to train. This seriously affects the efficiency of the experiments. Therefore, in this paper, we propose CRAT-XML, which uses contrast adversarial learning to optimize text representation and enhance the acquisition of dependency relations between text and labels, thus reducing the need for integration at the representation level and achieving relatively high accuracy under low-resource and low-time conditions. Experimental results demonstrate that our model achieves SOTA results on a single model, while achieving a large reduction in training time and model size.

UR - http://www.scopus.com/inward/record.url?scp=85166649465&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/2513/1/012002

DO - 10.1088/1742-6596/2513/1/012002

M3 - Conference article

AN - SCOPUS:85166649465

SN - 1742-6588

VL - 2513

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

IS - 1

M1 - 012002

Y2 - 24 February 2023 through 26 February 2023

ER -

CRAT-XML: Contrastive Representation Adversarial Training for Extremely Multi-Label Text Classification

Abstract

Access to Document

Other files and links

Fingerprint

Cite this