BIT-Xiaomi’s Simultaneous Translation System for AutoSimTrans 2022

Mengge Liu, Xiang Li, Bao Chen, Yanzhi Tian, Tianwei Lan, Silin Li, Yuhang Guo*, Jian Luan, Bin Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This system paper describes the BIT-Xiaomi simultaneous translation system for Autosimtrans 2022 simultaneous translation challenge. We participated in three tracks: the Zh-En text-to-text track, the Zh-En audio-to-text track, and the En-Es test-to-text track. In our system, wait-k is utilized to train prefix-to-prefix translation models. We integrate streaming chunking to detect segmentation boundaries as the source streaming reading in. We further improve our system with data selection, data augmentation, and R-Drop training methods. Results show that our wait-k implementation outperforms the organizer’s baseline by at most 8 BLEU score and our proposed streaming chunking method further improves by about 2 BLEU score in the low latency regime.

源语言英语
主期刊名AutoSimTrans 2022 - Automatic Simultaneous Translation Challenges, Recent Advances, and Future Directions, Proceedings of the 3rd Workshop
编辑Julia Ive, Ruiqing Zhang
出版商Association for Computational Linguistics (ACL)
34-42
页数9
ISBN(电子版)9781955917964
出版状态已出版 - 2022
活动3rd Workshop on Automatic Simultaneous Translation Challenges, Recent Advances, and Future Directions, AutoSimTrans 2022 - Virtual, Online
期限: 15 7月 202216 7月 2022

出版系列

姓名AutoSimTrans 2022 - Automatic Simultaneous Translation Challenges, Recent Advances, and Future Directions, Proceedings of the 3rd Workshop

会议

会议3rd Workshop on Automatic Simultaneous Translation Challenges, Recent Advances, and Future Directions, AutoSimTrans 2022
Virtual, Online
时期15/07/2216/07/22

指纹

探究 'BIT-Xiaomi’s Simultaneous Translation System for AutoSimTrans 2022' 的科研主题。它们共同构成独一无二的指纹。

引用此