Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting

Ping Guo, Yubing Ren, Yue Hu*, Yunpeng Li, Jiarui Zhang, Xingsheng Zhang, Heyan Huang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Citations (Scopus)

Abstract

Large Language Models (Llms) have achieved impressive results in Machine Translation by simply following instructions, even without training on parallel data. However, Llms still face challenges on low-resource languages due to the lack of pre-training data. In real-world situations, humans can become proficient in their native languages through abundant and meaningful social interactions and can also learn foreign languages effectively using well-organized textbooks. Drawing inspiration from human learning patterns, we introduce the Translate After LEarNing Textbook (Talent) approach, which aims to enhance Llms' ability to translate low-resource languages by learning from a textbook. Talent follows a step-by-step process: (1) Creating a Textbook for low-resource languages. (2) Guiding Llms to absorb the Textbook's content for Syntax Patterns. (3) Enhancing translation by utilizing the Textbook and Syntax Patterns. We thoroughly assess Talent's performance using 112 low-resource languages from FLORES-200 with two Llms: ChatGPT and BLOOMZ. Evaluation across three different metrics reveals that Talent consistently enhances translation performance by 14.8% compared to zero-shot baselines. Further analysis demonstrates that Talent not only improves Llms' comprehension of low-resource languages but also equips them with the knowledge needed to generate accurate and fluent sentences in these languages.

Original languageEnglish
Title of host publication2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings
EditorsNicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
PublisherEuropean Language Resources Association (ELRA)
Pages15685-15697
Number of pages13
ISBN (Electronic)9782493814104
Publication statusPublished - 2024
EventJoint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024 - Hybrid, Torino, Italy
Duration: 20 May 202425 May 2024

Publication series

Name2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings

Conference

ConferenceJoint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024
Country/TerritoryItaly
CityHybrid, Torino
Period20/05/2425/05/24

Keywords

  • Large Language Models
  • Low-resource Language Evaluation
  • Multilingual Machine Translation

Fingerprint

Dive into the research topics of 'Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting'. Together they form a unique fingerprint.

Cite this