A survey on evaluation of large language models
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …
industry, owing to their unprecedented performance in various applications. As LLMs …
Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
A survey of large language models
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
[PDF][PDF] Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors in agents
Autonomous agents empowered by Large Language Models (LLMs) have undergone
significant improvements, enabling them to generalize across a broad spectrum of tasks …
significant improvements, enabling them to generalize across a broad spectrum of tasks …
Evaluating large language models at evaluating instruction following
As research in large language models (LLMs) continues to accelerate, LLM-based
evaluation has emerged as a scalable and cost-effective alternative to human evaluations …
evaluation has emerged as a scalable and cost-effective alternative to human evaluations …
Generative judge for evaluating alignment
The rapid development of Large Language Models (LLMs) has substantially expanded the
range of tasks they can address. In the field of Natural Language Processing (NLP) …
range of tasks they can address. In the field of Natural Language Processing (NLP) …
Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors
Autonomous agents empowered by Large Language Models (LLMs) have undergone
significant improvements, enabling them to generalize across a broad spectrum of tasks …
significant improvements, enabling them to generalize across a broad spectrum of tasks …
Leave no document behind: Benchmarking long-context llms with extended multi-doc qa
Long-context modeling capabilities of Large Language Models (LLMs) have garnered
widespread attention, leading to the emergence of LLMs with ultra-context windows …
widespread attention, leading to the emergence of LLMs with ultra-context windows …
Branch-solve-merge improves large language model evaluation and generation
Large Language Models (LLMs) are frequently used for multi-faceted language generation
and evaluation tasks that involve satisfying intricate user constraints or taking into account …
and evaluation tasks that involve satisfying intricate user constraints or taking into account …
Optimization-based prompt injection attack to llm-as-a-judge
LLM-as-a-Judge uses a large language model (LLM) to select the best response from a set
of candidates for a given question. LLM-as-a-Judge has many applications such as LLM …
of candidates for a given question. LLM-as-a-Judge has many applications such as LLM …