Skip to main navigation Skip to search Skip to main content

Irish-based Large Language Model with Extreme Low-Resource Settings in Machine Translation

Research output: Chapter in Book/Report/Conference proceedingsConference proceedingpeer-review

Abstract

Large Language Models (LLMs) have demonstrated exceptional performances in a wide range of natural language processing tasks. However, their success does not always extend to machine translation, particularly in challenging scenarios such as translating low-resource languages. This study investigates the multilingual capability of LLMs, with a case study on Irish, an extremely low-resource language, focusing on translation tasks between English and Irish. We propose a dynamic, efficient language adaptation framework for English-centric LLMs, which involves layer-specific adjustments and subsequent fine-tuning for machine translation. Our findings highlight several key insights: (1) different layers in the LLM serve distinct functions such as language understanding and task reasoning, (2) effective translation requires extensive pre-training on both source and target languages, and (3) targeted fine-tuning for machine translation leads to significant improvements of 36.7% for English to Irish and 133.4% for Irish to English compared to the previous state-of-the-art.

Original languageEnglish
Title of host publicationLoResMT 2024 - 7th Workshop on Technologies for Machine Translation of Low-Resource Languages, Proceedings of the Workshop
EditorsAtul Kr. Ojha, Atul Kr. Ojha, Chao-hong Liu, Ekaterina Vylomova, Flammie Pirinen, Jade Abbott, Jonathan Washington, Nathaniel Oco, Valentin Malykh, Varvara Skolkovo Logacheva, Xiaobing Zhao
PublisherAssociation for Computational Linguistics (ACL)
Pages85-93
Number of pages9
ISBN (Electronic)9798891761490
Publication statusPublished - 2024
Event7th Workshop on Technologies for Machine Translation of Low-Resource Languages, LoResMT 2024 at ACL 2024 - Bangkok, Thailand
Duration: 15 Aug 2024 → …

Publication series

NameLoResMT 2024 - 7th Workshop on Technologies for Machine Translation of Low-Resource Languages, Proceedings of the Workshop

Conference

Conference7th Workshop on Technologies for Machine Translation of Low-Resource Languages, LoResMT 2024 at ACL 2024
Country/TerritoryThailand
CityBangkok
Period15/08/24 → …

Fingerprint

Dive into the research topics of 'Irish-based Large Language Model with Extreme Low-Resource Settings in Machine Translation'. Together they form a unique fingerprint.

Cite this