TY - GEN
T1 - Integrating tonal information into Mandarin name recognition with different strategies
AU - Luo, Dong Sheng
AU - Xie, Xiang
AU - Kuang, Jing Ming
PY - 2004
Y1 - 2004
N2 - Name recognition is a practical application of speech recognition technology. As Chinese is well known to be a tonal language, tonal information has important influence on this task. In this paper we integrate tonal information into a speaker-independent Mandarin name recognizer, and two combination strategies: feature combination and posterior combination are investigated firstly. The recognizer is evaluated on an extremely challenging Mandarin name corpus, which includes 100 tonally confusing pairs. Although a significant improvement in the recognition accuracy can be achieved with either strategy, the system has a poor flexibility. Based on the analysis of the experiment results we propose a two-step process to improve the system performance further. It is shown that a maximal improvement of 29.96% in word accuracy can be achieved. At the same time the system has a good flexibility with tonal information being integrated dynamically.
AB - Name recognition is a practical application of speech recognition technology. As Chinese is well known to be a tonal language, tonal information has important influence on this task. In this paper we integrate tonal information into a speaker-independent Mandarin name recognizer, and two combination strategies: feature combination and posterior combination are investigated firstly. The recognizer is evaluated on an extremely challenging Mandarin name corpus, which includes 100 tonally confusing pairs. Although a significant improvement in the recognition accuracy can be achieved with either strategy, the system has a poor flexibility. Based on the analysis of the experiment results we propose a two-step process to improve the system performance further. It is shown that a maximal improvement of 29.96% in word accuracy can be achieved. At the same time the system has a good flexibility with tonal information being integrated dynamically.
UR - https://www.scopus.com/pages/publications/21444440901
M3 - Conference contribution
AN - SCOPUS:21444440901
SN - 0780386787
SN - 9780780386785
T3 - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
SP - 265
EP - 268
BT - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
T2 - 2004 International Symposium on Chinese Spoken Language Processing
Y2 - 15 December 2004 through 18 December 2004
ER -