摘要
Speech synthesizer is commonly used in human-computer interaction. In many applicational cases, the computing resource is limited while real-time synthesis is demanded. The HMM-based speech synthesis technique allows creating a natural voice quality with small footprint, but current synthesizers require the concatenation of sentence level acoustic units, which is not applicable in real-time mode. In this paper, we propose a blocked parameter generation algorithm for low latency speech synthesis which can work real-time in resource limited applications. Phonetic units at various time spans are used as blocks. The objective and subjective evaluations suggest that the proposed system produce promising voice quality with a low demand for the computing resource.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 6890197 |
| 期刊 | Proceedings - IEEE International Conference on Multimedia and Expo |
| 卷 | 2014-September |
| 期 | Septmber |
| DOI | |
| 出版状态 | 已出版 - 3 9月 2014 |
| 活动 | 2014 IEEE International Conference on Multimedia and Expo, ICME 2014 - Chengdu, 中国 期限: 14 7月 2014 → 18 7月 2014 |
指纹
探究 'Low latency parameter generation for real-time speech synthesis system' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver