A dependency parser for spontaneous Chinese spoken language

Ruifang He*, Yaru Wang, Dawei Song, Peng Zhang, Yuan Jia, Aijun Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 3
  • Captures
    • Readers: 8
see details

Abstract

Dependency analysis is vital for spoken language understanding in spoken dialogue systems. However, existing research has mainly focused on western spoken languages, Japanese, and so on. Little research has been done for spoken Chinese in terms of dependency parsing. Therefore, the new spoken corpus, D-ESCSC (Dependency-Expressive Speech Corpus of Standard Chinese) is built by adding new dependency relations special to spoken Chinese based on a written Chinese annotation scheme. Since spoken Chinese contains typical ill-grammatical phenomena, e.g., translocation, repetition, duplication, and omission, the new atom feature related to punctuation and three feature templates are proposed to improve a graph-based dependency parser. Experimental results on spoken Chinese corpus show that the atom feature and three templates really work and the new parser outperforms the baseline parser. To our best knowledge, it is the first work to report dependency parsing results of spoken Chinese.

Original languageEnglish
Article number28
JournalACM Transactions on Asian and Low-Resource Language Information Processing
Volume17
Issue number4
DOIs
Publication statusPublished - Jul 2018

Keywords

  • Dependency parsing
  • Graph-based model
  • Spoken language
  • Spontaneous Chinese

Fingerprint

Dive into the research topics of 'A dependency parser for spontaneous Chinese spoken language'. Together they form a unique fingerprint.

Cite this

He, R., Wang, Y., Song, D., Zhang, P., Jia, Y., & Li, A. (2018). A dependency parser for spontaneous Chinese spoken language. ACM Transactions on Asian and Low-Resource Language Information Processing, 17(4), Article 28. https://doi.org/10.1145/3196278