GFN: A novel joint entity and relation extraction model with redundancy and denoising strategies

Xin Sun*, Qiyi Guo, Shi Qi Ge

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Joint entity and relation extraction refers to the extraction of entities and their corresponding relationships in the given sentence, which has gained increasing attention in recent years. Some joint extraction models utilize a shared encoder to model the interactions between named entity recognition and relation extraction subtasks. Despite achieving decent performance, they inevitably face the issue of error propagation. One-step exhaustive methods can mitigate the error propagation problem to some extent, but they suffer from issues such as huge computation complexity and a proliferation of negative samples. Therefore, addressing the problems mentioned above, we propose a Greedy Filter Network that combines Greedy-NER and Filter-RE. GFN employs the Greedy-NER with a redundancy strategy to prioritize recall, thereby reducing error propagation between subtasks. To reduce the computational complexity, we design an innovative approach to represent and store spans in Greedy-NER. In Filter-RE, we traverse all pairwise combinations of candidate entities. To address the issue of widespread negative samples, we design a denoising strategy with two filters, effectively filtering out entity pairs without relations, which can eliminate noise and alleviate the issue of negative sample proliferation. Finally, to enable flexible control over the redundancy strategy, we design two misclassifying penalty parameters for each module. The experimental results indicate that GFN achieves the state-of-the-art F1-score on the CoNLL04 and NYT datasets, with a notable 2.0% improvement observed specifically on CoNLL04.

Original languageEnglish
Article number112137
JournalKnowledge-Based Systems
Volume300
DOIs
Publication statusPublished - 27 Sept 2024

Keywords

  • Error propagation
  • Information extraction
  • Joint entity and relation extraction
  • Overlapping relations

Fingerprint

Dive into the research topics of 'GFN: A novel joint entity and relation extraction model with redundancy and denoising strategies'. Together they form a unique fingerprint.

Cite this