Madlad-400: A multilingual and document-level large audited dataset
We introduce MADLAD-400, a manually audited, general domain 3T token monolingual
dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations …
dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations …
Many-shot in-context learning
Large language models (LLMs) excel at few-shot in-context learning (ICL)--learning from a
few examples provided in context at inference, without any weight updates. Newly expanded …
few examples provided in context at inference, without any weight updates. Newly expanded …
Aya model: An instruction finetuned open-access multilingual language model
Recent breakthroughs in large language models (LLMs) have centered around a handful of
data-rich languages. What does it take to broaden access to breakthroughs beyond first …
data-rich languages. What does it take to broaden access to breakthroughs beyond first …
Understanding and mitigating language confusion in llms
We investigate a surprising limitation of LLMs: their inability to consistently generate text in a
user's desired language. We create the Language Confusion Benchmark (LCB) to evaluate …
user's desired language. We create the Language Confusion Benchmark (LCB) to evaluate …
Multilingual large language model: A survey of resources, taxonomy and frontiers
Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …
Models to handle and respond to queries in multiple languages, which achieves remarkable …
Do language models have a common sense regarding time? revisiting temporal commonsense reasoning in the era of large language models
Temporal reasoning represents a vital component of human communication and
understanding, yet remains an underexplored area within the context of Large Language …
understanding, yet remains an underexplored area within the context of Large Language …
Do large language models speak all languages equally? a comparative study in low-resource settings
Large language models (LLMs) have garnered significant interest in natural language
processing (NLP), particularly their remarkable performance in various downstream tasks in …
processing (NLP), particularly their remarkable performance in various downstream tasks in …
A survey of multilingual large language models
Multilingual large language models (MLLMs) leverage advanced large language models to
process and respond to queries across multiple languages, achieving significant success in …
process and respond to queries across multiple languages, achieving significant success in …
In-context learning with long-context models: An in-depth exploration
As model context lengths continue to increase, the number of demonstrations that can be
provided in-context approaches the size of entire training datasets. We study the behavior of …
provided in-context approaches the size of entire training datasets. We study the behavior of …
LLMeBench: A flexible framework for accelerating llms benchmarking
The recent development and success of Large Language Models (LLMs) necessitate an
evaluation of their performance across diverse NLP tasks in different languages. Although …
evaluation of their performance across diverse NLP tasks in different languages. Although …