TY - JOUR
T1 - A survey on event extraction in new domains
AU - Huang, Heyan
AU - Liu, Xiao
N1 - Publisher Copyright:
© 2022, Editorial Department of CAAI Transactions on Intelligent Systems. All rights reserved.
PY - 2022/1
Y1 - 2022/1
N2 - In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.
AB - In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.
KW - Collective extraction
KW - Event extraction
KW - Event factuality prediction
KW - Event schema induction
KW - Information extraction
KW - Knowledge base
KW - Natural language processing
KW - New domains
UR - http://www.scopus.com/inward/record.url?scp=85180224050&partnerID=8YFLogxK
U2 - 10.11992/tis.202109045
DO - 10.11992/tis.202109045
M3 - Article
AN - SCOPUS:85180224050
SN - 1673-4785
VL - 17
SP - 201
EP - 212
JO - CAAI Transactions on Intelligent Systems
JF - CAAI Transactions on Intelligent Systems
IS - 1
ER -