Implicit Discourse Relation Recognition by Scoring Special Tokens

Mingyang Cai*, Ping Jian, Yuhang Tian, Hai Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Implicit discourse relation recognition is one of the most difficult tasks in natural language understanding. Because of the lack of explicit connectives, the ability to extract logical information between the two discourse arguments is highly required. Previous studies mainly focus on designing complex neural network layers and various kinds of interactions between the two arguments, without further exploring the logical semantics in the pre-Trained language models. We propose a novel method that utilizes the power of the pre-Trained language model (RoBERTa) by introducing different kinds of extra special tokens, which can represent the relations between the discourse arguments respectively. On one hand, these special tokens learn to aggregate category features that profit classification; on the other hand, they can also be regarded as new 'words' that the prediction of them inspires the pre-Train language model. To effectively learn these special tokens, a scorer is then trained to give the discourse connected by corresponding special tokens higher scores than those with other special tokens. The experiments show the result of our approach exceeds our baseline by 4.24% F1, and the state-of-The-Art model by approximately 2.37% F1 on the four-way classification of the PDTB 2.0 dataset.

Original languageEnglish
Title of host publicationProceedings - 2023 3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023
EditorsWeijian Liu, Zhuo Zheng Wang, Peng You
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages407-413
Number of pages7
ISBN (Electronic)9798350302356
DOIs
Publication statusPublished - 2023
Event3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023 - Yangzhou, China
Duration: 26 May 202329 May 2023

Publication series

NameProceedings - 2023 3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023

Conference

Conference3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023
Country/TerritoryChina
CityYangzhou
Period26/05/2329/05/23

Keywords

  • Implicit Discourse Relation Recognition
  • Natural Language Processing

Fingerprint

Dive into the research topics of 'Implicit Discourse Relation Recognition by Scoring Special Tokens'. Together they form a unique fingerprint.

Cite this