Comic-guided speech synthesis

Yujia Wang, Wenguan Wang, Wei Liang*, Lap Fai Yu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

17 引用 (Scopus)

摘要

We introduce a novel approach for synthesizing realistic speeches for comics. Using a comic page as input, our approach synthesizes speeches for each comic character following the reading flow. It adopts a cascading strategy to synthesize speeches in two stages: Comic Visual Analysis and Comic Speech Synthesis. In the first stage, the input comic page is analyzed to identify the gender and age of the characters, as well as texts each character speaks and corresponding emotion. Guided by this analysis, in the second stage, our approach synthesizes realistic speeches for each character, which are consistent with the visual observations. Our experiments show that the proposed approach can synthesize realistic and lively speeches for different types of comics. Perceptual studies performed on the synthesis results of multiple sample comics validate the efficacy of our approach.

源语言英语
文章编号187
期刊ACM Transactions on Graphics
38
6
DOI
出版状态已出版 - 11月 2019

指纹

探究 'Comic-guided speech synthesis' 的科研主题。它们共同构成独一无二的指纹。

引用此