A comprehensive overview of large language models
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …
natural language processing tasks and beyond. This success of LLMs has led to a large …
Crosslingual generalization through multitask finetuning
Multitask prompted finetuning (MTF) has been shown to help large language models
generalize to new tasks in a zero-shot setting, but so far explorations of MTF have focused …
generalize to new tasks in a zero-shot setting, but so far explorations of MTF have focused …
Datasets for large language models: A comprehensive survey
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
Visit-bench: A benchmark for vision-language instruction following inspired by real-world use
We introduce VisIT-Bench (Visual InsTruction Benchmark), a benchmark for evaluation of
instruction-following vision-language models for real-world use. Our starting point is curating …
instruction-following vision-language models for real-world use. Our starting point is curating …
Datacomp-lm: In search of the next generation of training sets for language models
We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset
experiments with the goal of improving language models. As part of DCLM, we provide a …
experiments with the goal of improving language models. As part of DCLM, we provide a …
Data management for large language models: A survey
Data plays a fundamental role in the training of Large Language Models (LLMs). Effective
data management, particularly in the formulation of a well-suited training dataset, holds …
data management, particularly in the formulation of a well-suited training dataset, holds …
The shifted and the overlooked: A task-oriented investigation of user-GPT interactions
Recent progress in Large Language Models (LLMs) has produced models that exhibit
remarkable performance across a variety of NLP tasks. However, it remains unclear whether …
remarkable performance across a variety of NLP tasks. However, it remains unclear whether …
Active instruction tuning: Improving cross-task generalization by training on prompt sensitive tasks
Instruction tuning (IT) achieves impressive zero-shot generalization results by training large
language models (LLMs) on a massive amount of diverse tasks with instructions. However …
language models (LLMs) on a massive amount of diverse tasks with instructions. However …
Suri: Multi-constraint instruction following for long-form text generation
Existing research on instruction following largely focuses on tasks with simple instructions
and short responses. In this work, we explore multi-constraint instruction following for …
and short responses. In this work, we explore multi-constraint instruction following for …
Muffin: Curating multi-faceted instructions for improving instruction following
In the realm of large language models (LLMs), enhancing instruction-following capability
often involves curating expansive training data. This is achieved through two primary …
often involves curating expansive training data. This is achieved through two primary …