The application of CRFs in part-of-speech tagging

Xiaofei Zhang*, Heyan Huang, Liang Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 6
  • Captures
    • Readers: 13
see details

Abstract

Conditional random fields (CRFs) for sequence labeling offer advantages over both generative models like Hidden Markov model (HMM) and classifiers applied at each sequence position. First, the CRFs don't force to adhere to the independence assumption and thus can depend on arbitrary, non-independent features, without accounting for the distribution of those dependencies. Since CRFs models are able to flexibly utilize a wide variety of features, the training data sparse problem can be efficiently resolved. Moreover, the parameter estimation for CRFs is global, which effectively resolve the label bias problem. In this paper, the CRFs with Gaussian prior smoothing is used for Part-of-Speech (POS) tagging. Experiments show that the POS tagging error rate is reduced by 55.17% in close test and 43.64% in open test over HMM-based baseline, and synchronously an accuracy of 98.05% in close test and 95.79% in open test are also achieved. These positive results confirm CRFs superior performance.

Original languageEnglish
Title of host publication2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009
Pages347-350
Number of pages4
DOIs
Publication statusPublished - 2009
Event2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009 - Hangzhou, Zhejiang, China
Duration: 26 Aug 200927 Aug 2009

Publication series

Name2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009
Volume2

Conference

Conference2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009
Country/TerritoryChina
CityHangzhou, Zhejiang
Period26/08/0927/08/09

Keywords

  • CRF
  • HMM
  • Natural Language Processing (NLP)
  • POS tagging

Fingerprint

Dive into the research topics of 'The application of CRFs in part-of-speech tagging'. Together they form a unique fingerprint.

Cite this

Zhang, X., Huang, H., & Zhang, L. (2009). The application of CRFs in part-of-speech tagging. In 2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009 (pp. 347-350). Article 5335969 (2009 International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2009; Vol. 2). https://doi.org/10.1109/IHMSC.2009.210