Forward Translation to Mix Data for Speech Translation

Zhipeng Wang; Hongjing Xu; Shuoying Chen; Yuhang Guo

doi:10.1145/3594409.3594415

Forward Translation to Mix Data for Speech Translation

Zhipeng Wang, Hongjing Xu, Shuoying Chen, Yuhang Guo^*

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

End-to-End speech translation means that using a model to translate speech in one language into text in another language. Currently, the main challenge in the field of speech translation is data scarcity. Existing works solve this problem by using text information or applying data augmentation. However, these works only focus on the exploitation of a single corpus, ignoring the full use of existing human-labeled different-sources data. In this paper, we introduce a simple method to solve the data scarcity problem: training a model with simply mixed data and applying the forward translation method to expand the training set. We perform experiments on covost v2 French-English and mTEDx French-English. Our experiments demonstrate that combining the mixture of speech translation corpora with forward translation can yield a better result than the method without mixing.

Original language	English
Title of host publication	ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence
Publisher	Association for Computing Machinery
Pages	178-182
Number of pages	5
ISBN (Electronic)	9781450398398
DOIs	https://doi.org/10.1145/3594409.3594415
Publication status	Published - 3 Mar 2023
Event	7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023 - Harbin, China Duration: 3 Mar 2023 → 5 Mar 2023

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023
Country/Territory	China
City	Harbin
Period	3/03/23 → 5/03/23

Keywords

Data scarcity
Domain adaption
Forward-translation
Speech translation

Access to Document

10.1145/3594409.3594415

Cite this

Wang, Z., Xu, H., Chen, S., & Guo, Y. (2023). Forward Translation to Mix Data for Speech Translation. In ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence (pp. 178-182). (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3594409.3594415

@inproceedings{c9cd2a1fe902463385f7a7ae5ca34368,

title = "Forward Translation to Mix Data for Speech Translation",

abstract = "End-to-End speech translation means that using a model to translate speech in one language into text in another language. Currently, the main challenge in the field of speech translation is data scarcity. Existing works solve this problem by using text information or applying data augmentation. However, these works only focus on the exploitation of a single corpus, ignoring the full use of existing human-labeled different-sources data. In this paper, we introduce a simple method to solve the data scarcity problem: training a model with simply mixed data and applying the forward translation method to expand the training set. We perform experiments on covost v2 French-English and mTEDx French-English. Our experiments demonstrate that combining the mixture of speech translation corpora with forward translation can yield a better result than the method without mixing.",

keywords = "Data scarcity, Domain adaption, Forward-translation, Speech translation",

author = "Zhipeng Wang and Hongjing Xu and Shuoying Chen and Yuhang Guo",

note = "Publisher Copyright: {\textcopyright} 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.; 7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023 ; Conference date: 03-03-2023 Through 05-03-2023",

year = "2023",

month = mar,

day = "3",

doi = "10.1145/3594409.3594415",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery",

pages = "178--182",

booktitle = "ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence",

}

Wang, Z, Xu, H, Chen, S & Guo, Y 2023, Forward Translation to Mix Data for Speech Translation. in ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence. ACM International Conference Proceeding Series, Association for Computing Machinery, pp. 178-182, 7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023, Harbin, China, 3/03/23. https://doi.org/10.1145/3594409.3594415

Forward Translation to Mix Data for Speech Translation. / Wang, Zhipeng; Xu, Hongjing; Chen, Shuoying et al.
ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence. Association for Computing Machinery, 2023. p. 178-182 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Forward Translation to Mix Data for Speech Translation

AU - Wang, Zhipeng

AU - Xu, Hongjing

AU - Chen, Shuoying

AU - Guo, Yuhang

PY - 2023/3/3

Y1 - 2023/3/3

N2 - End-to-End speech translation means that using a model to translate speech in one language into text in another language. Currently, the main challenge in the field of speech translation is data scarcity. Existing works solve this problem by using text information or applying data augmentation. However, these works only focus on the exploitation of a single corpus, ignoring the full use of existing human-labeled different-sources data. In this paper, we introduce a simple method to solve the data scarcity problem: training a model with simply mixed data and applying the forward translation method to expand the training set. We perform experiments on covost v2 French-English and mTEDx French-English. Our experiments demonstrate that combining the mixture of speech translation corpora with forward translation can yield a better result than the method without mixing.

AB - End-to-End speech translation means that using a model to translate speech in one language into text in another language. Currently, the main challenge in the field of speech translation is data scarcity. Existing works solve this problem by using text information or applying data augmentation. However, these works only focus on the exploitation of a single corpus, ignoring the full use of existing human-labeled different-sources data. In this paper, we introduce a simple method to solve the data scarcity problem: training a model with simply mixed data and applying the forward translation method to expand the training set. We perform experiments on covost v2 French-English and mTEDx French-English. Our experiments demonstrate that combining the mixture of speech translation corpora with forward translation can yield a better result than the method without mixing.

KW - Data scarcity

KW - Domain adaption

KW - Forward-translation

KW - Speech translation

UR - http://www.scopus.com/inward/record.url?scp=85168866328&partnerID=8YFLogxK

U2 - 10.1145/3594409.3594415

DO - 10.1145/3594409.3594415

M3 - Conference contribution

AN - SCOPUS:85168866328

T3 - ACM International Conference Proceeding Series

SP - 178

EP - 182

BT - ICIAI 2023 - 7th International Conference on Innovation in Artificial Intelligence

PB - Association for Computing Machinery

T2 - 7th International Conference on Innovation in Artificial Intelligence, ICIAI 2023

Y2 - 3 March 2023 through 5 March 2023

ER -

Forward Translation to Mix Data for Speech Translation

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this