摘要
An approach for Chinese named entity identification using cascaded hidden Markov model, which aimed to incorporate person name, location name, organization name recognition into an integrated theoretical frame was presented. Simple named entity was recognized by lower HMM model after rough segmentation and complex named entity such as person name, location name and organization name was recognized by higher HMM model using role tagging. In the test on large realistic corpus, its F-l measure of person name, location name and organization name was 92.55%, 94.53% and 86.51%. In the first international word segmentation bakeoff held by SIGHAN (the ACL Special Interest Group on Chinese Language Processing) in 2003. ICTCLAS, which name entity identification base on this model achieved excellent score.
源语言 | 英语 |
---|---|
页(从-至) | 87-94 |
页数 | 8 |
期刊 | Tongxin Xuebao/Journal on Communications |
卷 | 27 |
期 | 2 |
出版状态 | 已出版 - 2月 2006 |
已对外发布 | 是 |