Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off

Research output: Contribution to conferencePaperpeer-review

4 Citations (Scopus)

Abstract

This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.

Original languageEnglish
Pages179-185
Number of pages7
Publication statusPublished - 2014
Event3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 - Wuhan, China
Duration: 20 Oct 201421 Oct 2014

Conference

Conference3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014
Country/TerritoryChina
CityWuhan
Period20/10/1421/10/14

Fingerprint

Dive into the research topics of 'Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off'. Together they form a unique fingerprint.

Cite this