Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Heng Da Xu; Zhongli Li; Qingyu Zhou; Chao Li; Zizhen Wang; Yunbo Cao; Heyan Huang; Xian Ling Mao

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Heng Da Xu, Zhongli Li, Qingyu Zhou^*, Chao Li, Zizhen Wang, Yunbo Cao, Heyan Huang, Xian Ling Mao

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

55 Citations (Scopus)

Abstract

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts notice this phenomenon and try to utilize the similarity relationship for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called REALISE, by directly leveraging the multimodal information of the Chinese characters. The REALISE model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

Original language	English
Title of host publication	Findings of the Association for Computational Linguistics
Subtitle of host publication	ACL-IJCNLP 2021
Editors	Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
Publisher	Association for Computational Linguistics (ACL)
Pages	716-728
Number of pages	13
ISBN (Electronic)	9781954085541
Publication status	Published - 2021
Event	Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 - Virtual, Online Duration: 1 Aug 2021 → 6 Aug 2021

Publication series

Name	Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Conference

Conference	Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
City	Virtual, Online
Period	1/08/21 → 6/08/21

Cite this

Xu, H. D., Li, Z., Zhou, Q., Li, C., Wang, Z., Cao, Y., Huang, H., & Mao, X. L. (2021). Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking. In C. Zong, F. Xia, W. Li, & R. Navigli (Eds.), Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 716-728). (Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021). Association for Computational Linguistics (ACL).

Xu, Heng Da ; Li, Zhongli ; Zhou, Qingyu et al. / Read, Listen, and See : Leveraging Multimodal Information Helps Chinese Spell Checking. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. editor / Chengqing Zong ; Fei Xia ; Wenjie Li ; Roberto Navigli. Association for Computational Linguistics (ACL), 2021. pp. 716-728 (Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021).

@inproceedings{81f9481f4cc74e0f98847bdf8a55b385,

title = "Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking",

abstract = "Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts notice this phenomenon and try to utilize the similarity relationship for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called REALISE, by directly leveraging the multimodal information of the Chinese characters. The REALISE model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.",

author = "Xu, {Heng Da} and Zhongli Li and Qingyu Zhou and Chao Li and Zizhen Wang and Yunbo Cao and Heyan Huang and Mao, {Xian Ling}",

note = "Publisher Copyright: {\textcopyright} 2021 Association for Computational Linguistics; Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 ; Conference date: 01-08-2021 Through 06-08-2021",

year = "2021",

language = "English",

series = "Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021",

publisher = "Association for Computational Linguistics (ACL)",

pages = "716--728",

editor = "Chengqing Zong and Fei Xia and Wenjie Li and Roberto Navigli",

booktitle = "Findings of the Association for Computational Linguistics",

address = "United States",

}

Xu, HD, Li, Z, Zhou, Q, Li, C, Wang, Z, Cao, Y, Huang, H & Mao, XL 2021, Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking. in C Zong, F Xia, W Li & R Navigli (eds), Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Association for Computational Linguistics (ACL), pp. 716-728, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Virtual, Online, 1/08/21.

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking. / Xu, Heng Da; Li, Zhongli; Zhou, Qingyu et al.
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. ed. / Chengqing Zong; Fei Xia; Wenjie Li; Roberto Navigli. Association for Computational Linguistics (ACL), 2021. p. 716-728 (Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Read, Listen, and See

T2 - Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

AU - Xu, Heng Da

AU - Li, Zhongli

AU - Zhou, Qingyu

AU - Li, Chao

AU - Wang, Zizhen

AU - Cao, Yunbo

AU - Huang, Heyan

AU - Mao, Xian Ling

PY - 2021

Y1 - 2021

N2 - Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts notice this phenomenon and try to utilize the similarity relationship for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called REALISE, by directly leveraging the multimodal information of the Chinese characters. The REALISE model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

AB - Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts notice this phenomenon and try to utilize the similarity relationship for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called REALISE, by directly leveraging the multimodal information of the Chinese characters. The REALISE model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

UR - http://www.scopus.com/inward/record.url?scp=85123235144&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85123235144

T3 - Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

SP - 716

EP - 728

BT - Findings of the Association for Computational Linguistics

A2 - Zong, Chengqing

A2 - Xia, Fei

A2 - Li, Wenjie

A2 - Navigli, Roberto

PB - Association for Computational Linguistics (ACL)

Y2 - 1 August 2021 through 6 August 2021

ER -

Xu HD, Li Z, Zhou Q, Li C, Wang Z, Cao Y et al. Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking. In Zong C, Xia F, Li W, Navigli R, editors, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. Association for Computational Linguistics (ACL). 2021. p. 716-728. (Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021).

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this