BIT-Event at NLPCC-2021 Task 3: Subevent Identification via Adversarial Training

Xiao Liu, Ge Shi, Bo Wang, Changsen Yuan, Heyan Huang*, Chong Feng, Lifang Wu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

This paper describes the system proposed by the BIT-Event team for NLPCC 2021 shared task on Subevent Identification. The task includes two settings, and these settings face less reliable labeled data and the dilemma about selecting the most valid data to annotate, respectively. Without the luxury of training data, we propose a hybrid system based on semi-supervised algorithms to enhance the performance by effectively learning from a large amount of unlabeled corpus. In this hybrid model, we first fine-tune the pre-trained model to adapt it to the training data scenario. Besides, Adversarial Training and Virtual Adversarial Training are combined to enhance the effect of a single model with unlabeled in-domain data. The additional information is further captured via retraining using pseudo-labels. On the other hand, we apply Active Learning as an iterative process that starts from a small number of labeled seeding instances. The experimental results suggest that the semi-supervised methods fit the low-resource subevent identification problem well. Our best results were obtained by an ensemble of these methods. According to the official results, our approach proved the best for all the settings in this task.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 10th CCF International Conference, NLPCC 2021, Proceedings
编辑Lu Wang, Yansong Feng, Yu Hong, Ruifang He
出版商Springer Science and Business Media Deutschland GmbH
400-411
页数12
ISBN(印刷版)9783030884826
DOI
出版状态已出版 - 2021
活动10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021 - Qingdao, 中国
期限: 13 10月 202117 10月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13029 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021
国家/地区中国
Qingdao
时期13/10/2117/10/21

指纹

探究 'BIT-Event at NLPCC-2021 Task 3: Subevent Identification via Adversarial Training' 的科研主题。它们共同构成独一无二的指纹。

引用此