Forward Translation to Mix Data for Speech Translation

Zhipeng Wang, Hongjing Xu, Shuoying Chen, Yuhang Guo*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

End-to-End speech translation means that using a model to translate speech in one language into text in another language. Currently, the main challenge in the field of speech translation is data scarcity. Existing works solve this problem by using text information or applying data augmentation. However, these works only focus on the exploitation of a single corpus, ignoring the full use of existing human-labeled different-sources data. In this paper, we introduce a simple method to solve the data scarcity problem: training a model with simply mixed data and applying the forward translation method to expand the training set. We perform experiments on covost v2 French-English and mTEDx French-English. Our experiments demonstrate that combining the mixture of speech translation corpora with forward translation can yield a better result than the method without mixing.

源语言英语
主期刊名ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence
出版商Association for Computing Machinery
178-182
页数5
ISBN(电子版)9781450398398
DOI
出版状态已出版 - 3 3月 2023
活动7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023 - Harbin, 中国
期限: 3 3月 20235 3月 2023

出版系列

姓名ACM International Conference Proceeding Series

会议

会议7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023
国家/地区中国
Harbin
时期3/03/235/03/23

指纹

探究 'Forward Translation to Mix Data for Speech Translation' 的科研主题。它们共同构成独一无二的指纹。

引用此

Wang, Z., Xu, H., Chen, S., & Guo, Y. (2023). Forward Translation to Mix Data for Speech Translation. 在 ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence (页码 178-182). (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3594409.3594415