跳到主要导航 跳到搜索 跳到主要内容

The Xiaomi AI Lab’s Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task

  • Wuwei Huang*
  • , Mengge Liu
  • , Xiang Li
  • , Yanzhi Tian
  • , Fengyu Yang
  • , Wen Zhang
  • , Yuhang Guo
  • , Jinsong Su
  • , Jian Luan
  • , Bin Wang
  • *此作品的通讯作者
  • Xiaomi
  • Beijing Institute of Technology
  • Xiamen University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This system description paper introduces the systems submitted by Xiaomi AI Lab to the three tracks of the IWSLT 2023 Evaluation Campaign, namely the offline speech translation (Offline-ST) track, the offline speech-to-speech translation (Offline-S2ST) track, and the simultaneous speech translation (Simul-ST) track. All our submissions for these three tracks only involve the English-Chinese language direction. Our English-Chinese speech translation systems are constructed using large-scale pre-trained models as the foundation. Specifically, we fine-tune these models’ corresponding components for various downstream speech translation tasks. Moreover, we implement several popular techniques, such as data filtering, data augmentation, speech segmentation, and model ensemble, to improve the system’s overall performance. Extensive experiments show that our systems achieve a significant improvement over the strong baseline systems in terms of the automatic evaluation metric.

源语言英语
主期刊名20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference
编辑Elizabeth Salesky, Marcello Federico, Marine Carpuat
出版商Association for Computational Linguistics
411-419
页数9
ISBN(电子版)9781959429845
出版状态已出版 - 2023
活动20th International Conference on Spoken Language Translation, IWSLT 2023 - Hybrid, Toronto, 加拿大
期限: 13 7月 202314 7月 2023

出版系列

姓名20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference

会议

会议20th International Conference on Spoken Language Translation, IWSLT 2023
国家/地区加拿大
Hybrid, Toronto
时期13/07/2314/07/23

指纹

探究 'The Xiaomi AI Lab’s Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task' 的科研主题。它们共同构成独一无二的指纹。

引用此