Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off

Min Liu; Ping Jian; Heyan Huang

Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off

Min Liu, Ping Jian, Heyan Huang

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to conference › Paper › peer-review

4 Citations (Scopus)

Abstract

This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.

Original language	English
Pages	179-185
Number of pages	7
Publication status	Published - 2014
Event	3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 - Wuhan, China Duration: 20 Oct 2014 → 21 Oct 2014

Conference

Conference	3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014
Country/Territory	China
City	Wuhan
Period	20/10/14 → 21/10/14

Cite this

@conference{bd60063946914ed3818368aa9e0a4bf7,

title = "Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off",

abstract = "This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.",

author = "Min Liu and Ping Jian and Heyan Huang",

note = "Publisher Copyright: {\textcopyright} 2014 CLP 2014 - 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing. All rights reserved.; 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 ; Conference date: 20-10-2014 Through 21-10-2014",

year = "2014",

language = "English",

pages = "179--185",

}

TY - CONF

T1 - Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off

AU - Liu, Min

AU - Jian, Ping

AU - Huang, Heyan

PY - 2014

Y1 - 2014

N2 - This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.

AB - This paper describes the Chinese spelling correction system submitted by BIT at CLP Bake-off 2014 task 2. The system mainly includes two parts: 1) N-gram model is adopted to retrieve the non-words which are wrongly separated by word segmentation. The non-words are then corrected in terms of word frequency, pronunciation similarity, shape similarity and POS (part of speech) tag. 2) For wrong words, abnormal POS tag is used to indicate their location and dependency relation matching is employed to correct them. Experiment results demonstrate the effectiveness of our system.

UR - http://www.scopus.com/inward/record.url?scp=84989188767&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84989188767

SP - 179

EP - 185

T2 - 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014

Y2 - 20 October 2014 through 21 October 2014

ER -

Introduction to BIT Chinese Spelling Correction System at CLP 2014 Bake-off

Abstract

Conference

Other files and links

Fingerprint

Cite this