TY - JOUR
T1 - A dependency parser for spontaneous Chinese spoken language
AU - He, Ruifang
AU - Wang, Yaru
AU - Song, Dawei
AU - Zhang, Peng
AU - Jia, Yuan
AU - Li, Aijun
N1 - Publisher Copyright:
© 2018 ACM.
PY - 2018/7
Y1 - 2018/7
N2 - Dependency analysis is vital for spoken language understanding in spoken dialogue systems. However, existing research has mainly focused on western spoken languages, Japanese, and so on. Little research has been done for spoken Chinese in terms of dependency parsing. Therefore, the new spoken corpus, D-ESCSC (Dependency-Expressive Speech Corpus of Standard Chinese) is built by adding new dependency relations special to spoken Chinese based on a written Chinese annotation scheme. Since spoken Chinese contains typical ill-grammatical phenomena, e.g., translocation, repetition, duplication, and omission, the new atom feature related to punctuation and three feature templates are proposed to improve a graph-based dependency parser. Experimental results on spoken Chinese corpus show that the atom feature and three templates really work and the new parser outperforms the baseline parser. To our best knowledge, it is the first work to report dependency parsing results of spoken Chinese.
AB - Dependency analysis is vital for spoken language understanding in spoken dialogue systems. However, existing research has mainly focused on western spoken languages, Japanese, and so on. Little research has been done for spoken Chinese in terms of dependency parsing. Therefore, the new spoken corpus, D-ESCSC (Dependency-Expressive Speech Corpus of Standard Chinese) is built by adding new dependency relations special to spoken Chinese based on a written Chinese annotation scheme. Since spoken Chinese contains typical ill-grammatical phenomena, e.g., translocation, repetition, duplication, and omission, the new atom feature related to punctuation and three feature templates are proposed to improve a graph-based dependency parser. Experimental results on spoken Chinese corpus show that the atom feature and three templates really work and the new parser outperforms the baseline parser. To our best knowledge, it is the first work to report dependency parsing results of spoken Chinese.
KW - Dependency parsing
KW - Graph-based model
KW - Spoken language
KW - Spontaneous Chinese
UR - http://www.scopus.com/inward/record.url?scp=85053373843&partnerID=8YFLogxK
U2 - 10.1145/3196278
DO - 10.1145/3196278
M3 - Article
AN - SCOPUS:85053373843
SN - 2375-4699
VL - 17
JO - ACM Transactions on Asian and Low-Resource Language Information Processing
JF - ACM Transactions on Asian and Low-Resource Language Information Processing
IS - 4
M1 - 28
ER -