BIT’s System for Multilingual Track

Zhipeng Wang, Yuhang Guo*, Shuoying Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

This paper describes the system we submitted to the IWSLT 2023 multilingual speech translation track, with the input is speech from one language, and the output is text from 10 target languages. Our system consists of CNN and Transformer, convolutional neural networks downsample speech features and extract local information, while transformer extract global features and output the final results. In our system, we use speech recognition tasks to pre-train encoder parameters, and then use speech translation corpus to train the multilingual speech translation model. We have also adopted other methods to optimize the model, such as data augmentation, model ensemble, etc. Our system can obtain satisfactory results on test sets of 10 languages in the MUST-C corpus.

Original languageEnglish
Title of host publication20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference
EditorsElizabeth Salesky, Marcello Federico, Marine Carpuat
PublisherAssociation for Computational Linguistics
Pages455-460
Number of pages6
ISBN (Electronic)9781959429845
Publication statusPublished - 2023
Event20th International Conference on Spoken Language Translation, IWSLT 2023 - Hybrid, Toronto, Canada
Duration: 13 Jul 202314 Jul 2023

Publication series

Name20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference

Conference

Conference20th International Conference on Spoken Language Translation, IWSLT 2023
Country/TerritoryCanada
CityHybrid, Toronto
Period13/07/2314/07/23

Fingerprint

Dive into the research topics of 'BIT’s System for Multilingual Track'. Together they form a unique fingerprint.

Cite this