Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Chen, Pinzhen; Ji, Shaoxiong; Bogoychev, Nikolay; Kutuzov, Andrey; Haddow, Barry; Heafield, Kenneth

Computer Science > Computation and Language

arXiv:2309.08958 (cs)

[Submitted on 16 Sep 2023 (v1), last revised 31 Jan 2024 (this version, v2)]

Title:Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Authors:Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Andrey Kutuzov, Barry Haddow, Kenneth Heafield

View PDF

Abstract:Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants. While such efforts are often carried out in a single language, we empirically analyze cost-efficient strategies for multilingual scenarios. Our study employs the Alpaca dataset and machine translations of it to form multilingual data, which is then used to tune LLMs through either low-rank adaptation or full-parameter training. Under a controlled computation budget, comparisons show that multilingual tuning is on par or better than tuning a model for each language. Furthermore, multilingual tuning with downsampled data can be as powerful and more robust. Our findings serve as a guide for expanding language support through instruction tuning.

Comments:	Accepted to Findings of ACL: EACL 2024. Added human evaluation and shortened writing
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.08958 [cs.CL]
	(or arXiv:2309.08958v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.08958

Submission history

From: Pinzhen Chen [view email]
[v1] Sat, 16 Sep 2023 11:22:46 UTC (46 KB)
[v2] Wed, 31 Jan 2024 03:42:04 UTC (39 KB)

Computer Science > Computation and Language

Title:Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators