A survey on event extraction in new domains

Heyan Huang; Xiao Liu

doi:10.11992/tis.202109045

A survey on event extraction in new domains

Heyan Huang^*, Xiao Liu

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.

源语言	英语
页（从-至）	201-212
页数	12
期刊	CAAI Transactions on Intelligent Systems
卷	17
期	1
DOI	https://doi.org/10.11992/tis.202109045
出版状态	已出版 - 1月 2022

访问文件

10.11992/tis.202109045

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{e01e9b6c6163429799ba5829e7c685f1,

title = "A survey on event extraction in new domains",

abstract = "In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.",

keywords = "Collective extraction, Event extraction, Event factuality prediction, Event schema induction, Information extraction, Knowledge base, Natural language processing, New domains",

author = "Heyan Huang and Xiao Liu",

year = "2022",

month = jan,

doi = "10.11992/tis.202109045",

language = "English",

volume = "17",

pages = "201--212",

journal = "CAAI Transactions on Intelligent Systems",

issn = "1673-4785",

publisher = "Editorial Department of CAAI Transactions on Intelligent Systems",

number = "1",

}

TY - JOUR

T1 - A survey on event extraction in new domains

AU - Huang, Heyan

AU - Liu, Xiao

PY - 2022/1

Y1 - 2022/1

N2 - In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.

AB - In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: Event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.

KW - Collective extraction

KW - Event extraction

KW - Event factuality prediction

KW - Event schema induction

KW - Information extraction

KW - Knowledge base

KW - Natural language processing

KW - New domains

UR - http://www.scopus.com/inward/record.url?scp=85180224050&partnerID=8YFLogxK

U2 - 10.11992/tis.202109045

DO - 10.11992/tis.202109045

M3 - Article

AN - SCOPUS:85180224050

SN - 1673-4785

VL - 17

SP - 201

EP - 212

JO - CAAI Transactions on Intelligent Systems

JF - CAAI Transactions on Intelligent Systems

IS - 1

ER -

A survey on event extraction in new domains

摘要

访问文件

其它文件与链接

指纹

引用此