The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022

Bao Guo, Mengge Liu, Wen Zhang, Hexuan Chen, Chang Mu, Xiang Li, Jianwei Cui, Bin Wang, Yuhang Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

This system paper describes the Xiaomi Translation System for the IWSLT 2022 Simultaneous Speech Translation (noted as SST) shared task. We participate in the English-to-Mandarin Chinese Text-to-Text (noted as T2T) track. Our system is built based on the Transformer model with novel techniques borrowed from our recent research work. For the data filtering, language-model-based and rule-based methods are conducted to filter the data to obtain high-quality bilingual parallel corpora. We also strengthen our system with some dominating techniques related to data augmentation, such as knowledge distillation, tagged back-translation, and iterative back-translation. We also incorporate novel training techniques such as R-drop, deep model, and large batch training which have been shown to be beneficial to the naive Transformer model. In the SST scenario, several variations of wait-k strategies are explored. Furthermore, in terms of robustness, both data-based and model-based ways are used to reduce the sensitivity of our system to Automatic Speech Recognition (ASR) outputs. We finally design some inference algorithms and use the adaptive-ensemble method based on multiple model variants to further improve the performance of the system. Compared with strong baselines, fusing all techniques can improve our system by 2~3 BLEU scores under different latency regimes.

Original languageEnglish
Title of host publicationIWSLT 2022 - 19th International Conference on Spoken Language Translation, Proceedings of the Conference
EditorsElizabeth Salesky, Marcello Federico, Marta Costa-Jussa
PublisherAssociation for Computational Linguistics (ACL)
Pages216-224
Number of pages9
ISBN (Electronic)9781955917414
Publication statusPublished - 2022
Event19th International Conference on Spoken Language Translation, IWSLT 2022 - Dublin, Ireland
Duration: 26 May 202227 May 2022

Publication series

NameIWSLT 2022 - 19th International Conference on Spoken Language Translation, Proceedings of the Conference

Conference

Conference19th International Conference on Spoken Language Translation, IWSLT 2022
Country/TerritoryIreland
CityDublin
Period26/05/2227/05/22

Fingerprint

Dive into the research topics of 'The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022'. Together they form a unique fingerprint.

Cite this