TY - GEN
T1 - Review-Based Curriculum Learning for Neural Machine Translation
AU - Hui, Ziyang
AU - Feng, Chong
AU - Zhang, Tianfu
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - For Neural Machine Translation (NMT) tasks with limited domain resources, curriculum learning provides a way to simulate the human learning process from simple to difficult to adapt the general NMT model to a specific domain. However, previous curriculum learning methods suffer from catastrophic forgetting and learning inefficiency. In this paper, we introduce a review-based curriculum learning method, targetedly selecting curriculum according to long time interval or unskilled mastery. Furthermore, we add general domain data to curriculum learning, using the mixed fine-tuning method, to improve generalization and robustness of translation. Extensive experimental results and analysis show that our method outperforms other curriculum learning baselines across three specific domains.
AB - For Neural Machine Translation (NMT) tasks with limited domain resources, curriculum learning provides a way to simulate the human learning process from simple to difficult to adapt the general NMT model to a specific domain. However, previous curriculum learning methods suffer from catastrophic forgetting and learning inefficiency. In this paper, we introduce a review-based curriculum learning method, targetedly selecting curriculum according to long time interval or unskilled mastery. Furthermore, we add general domain data to curriculum learning, using the mixed fine-tuning method, to improve generalization and robustness of translation. Extensive experimental results and analysis show that our method outperforms other curriculum learning baselines across three specific domains.
KW - Domain adaptation
KW - Neural machine translation
KW - Review-based curriculum learning
UR - http://www.scopus.com/inward/record.url?scp=85145255338&partnerID=8YFLogxK
U2 - 10.1007/978-981-19-7960-6_3
DO - 10.1007/978-981-19-7960-6_3
M3 - Conference contribution
AN - SCOPUS:85145255338
SN - 9789811979590
T3 - Communications in Computer and Information Science
SP - 24
EP - 36
BT - Machine Translation - 18th China Conference, CCMT 2022, Revised Selected Papers
A2 - Xiao, Tong
A2 - Pino, Juan
PB - Springer Science and Business Media Deutschland GmbH
T2 - 18th China Conference on Machine Translation, CCMT 2022
Y2 - 6 August 2022 through 10 August 2022
ER -