Madlad-400: A multilingual and document-level large audited dataset

S Kudugunta, I Caswell, B Zhang… - Advances in …, 2024 - proceedings.neurips.cc
We introduce MADLAD-400, a manually audited, general domain 3T token monolingual
dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations …

Many-shot in-context learning

R Agarwal, A Singh, LM Zhang, B Bohnet… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) excel at few-shot in-context learning (ICL)--learning from a
few examples provided in context at inference, without any weight updates. Newly expanded …

Aya model: An instruction finetuned open-access multilingual language model

A Üstün, V Aryabumi, ZX Yong, WY Ko… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent breakthroughs in large language models (LLMs) have centered around a handful of
data-rich languages. What does it take to broaden access to breakthroughs beyond first …

Understanding and mitigating language confusion in llms

K Marchisio, WY Ko, A Bérard, T Dehaze… - arxiv preprint arxiv …, 2024 - arxiv.org
We investigate a surprising limitation of LLMs: their inability to consistently generate text in a
user's desired language. We create the Language Confusion Benchmark (LCB) to evaluate …

Multilingual large language model: A survey of resources, taxonomy and frontiers

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao… - arxiv preprint arxiv …, 2024 - arxiv.org
Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …

Do language models have a common sense regarding time? revisiting temporal commonsense reasoning in the era of large language models

R Jain, D Sojitra, A Acharya, S Saha… - Proceedings of the …, 2023 - aclanthology.org
Temporal reasoning represents a vital component of human communication and
understanding, yet remains an underexplored area within the context of Large Language …

Do large language models speak all languages equally? a comparative study in low-resource settings

MA Hasan, P Tarannum, K Dey, I Razzak… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have garnered significant interest in natural language
processing (NLP), particularly their remarkable performance in various downstream tasks in …

A survey of multilingual large language models

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao, M Li… - Patterns, 2025 - cell.com
Multilingual large language models (MLLMs) leverage advanced large language models to
process and respond to queries across multiple languages, achieving significant success in …

In-context learning with long-context models: An in-depth exploration

A Bertsch, M Ivgi, U Alon, J Berant, MR Gormley… - arxiv preprint arxiv …, 2024 - arxiv.org
As model context lengths continue to increase, the number of demonstrations that can be
provided in-context approaches the size of entire training datasets. We study the behavior of …

LLMeBench: A flexible framework for accelerating llms benchmarking

F Dalvi, M Hasanain, S Boughorbel, B Mousi… - arxiv preprint arxiv …, 2023 - arxiv.org
The recent development and success of Large Language Models (LLMs) necessitate an
evaluation of their performance across diverse NLP tasks in different languages. Although …