Abstract
This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.
Original language | English |
---|---|
Pages | 179-185 |
Number of pages | 7 |
Publication status | Published - 2014 |
Event | 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 - Wuhan, China Duration: 20 Oct 2014 → 21 Oct 2014 |
Conference
Conference | 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 |
---|---|
Country/Territory | China |
City | Wuhan |
Period | 20/10/14 → 21/10/14 |