跳到主要导航 跳到搜索 跳到主要内容

Low latency parameter generation for real-time speech synthesis system

科研成果: 期刊稿件会议文章同行评审

摘要

Speech synthesizer is commonly used in human-computer interaction. In many applicational cases, the computing resource is limited while real-time synthesis is demanded. The HMM-based speech synthesis technique allows creating a natural voice quality with small footprint, but current synthesizers require the concatenation of sentence level acoustic units, which is not applicable in real-time mode. In this paper, we propose a blocked parameter generation algorithm for low latency speech synthesis which can work real-time in resource limited applications. Phonetic units at various time spans are used as blocks. The objective and subjective evaluations suggest that the proposed system produce promising voice quality with a low demand for the computing resource.

源语言英语
文章编号6890197
期刊Proceedings - IEEE International Conference on Multimedia and Expo
2014-September
Septmber
DOI
出版状态已出版 - 3 9月 2014
活动2014 IEEE International Conference on Multimedia and Expo, ICME 2014 - Chengdu, 中国
期限: 14 7月 201418 7月 2014

指纹

探究 'Low latency parameter generation for real-time speech synthesis system' 的科研主题。它们共同构成独一无二的指纹。

引用此