- Academic Search

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Save Cite Cited by 115 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aclanthology.org

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

MTR Laskar, S Alqahtani, MS Bari… - Proceedings of the …, 2024 - aclanthology.org

Abstract Large Language Models (LLMs) have recently gained significant attention due to
their remarkable capabilities in performing diverse tasks across various domains. However …

Save Cite Cited by 12 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] biorxiv.org

Protein language models are biased by unequal sequence sampling across the tree of life

F Ding, J Steinhardt - BioRxiv, 2024 - biorxiv.org

Protein language models (pLMs) trained on large protein sequence databases have been
used to understand disease and design novel proteins. In design tasks, the likelihood of a …

Save Cite Cited by 21 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Evaluating language model agency through negotiations

TR Davidson, V Veselovsky, M Josifoski… - arxiv preprint arxiv …, 2024 - arxiv.org

Companies, organizations, and governments increasingly exploit Language Models'(LM)
remarkable capability to display agent-like behavior. As LMs are adopted to perform tasks …

Save Cite Cited by 14 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

H **a, S Gao, Q Ge, Z **, Q Zhang, X Huang - arxiv preprint arxiv …, 2024 - arxiv.org

Reinforcement Learning from Human Feedback (RLHF) has proven effective in aligning
large language models with human intentions, yet it often relies on complex methodologies …

Save Cite Cited by 3 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmark

G Zhang, M Hardt - arxiv preprint arxiv:2405.01719, 2024 - arxiv.org

We examine multi-task benchmarks in machine learning through the lens of social choice
theory. We draw an analogy between benchmarks and electoral systems, where models are …

Save Cite Cited by 5 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] acm.org

Automating government report generation: A generative ai approach for efficient data extraction, analysis, and visualization

R Gupta, G Pandey, SK Pal - Digital Government: Research and Practice, 2024 - dl.acm.org

This application paper introduces a transformative solution to address the labour-intensive
manual report generation, data searching & report revision process in government entities …

Save Cite Cited by 2 Related articles

[Free GPT-4]

[PDF] arxiv.org

PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison

CH Park, M Choi, D Lee, J Choo - arxiv preprint arxiv:2404.01015, 2024 - arxiv.org

Building a reliable and automated evaluation metric is a necessary but challenging problem
for open-domain dialogue systems. Recent studies proposed evaluation metrics that assess …

Save Cite Cited by 5 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Compare without Despair: Reliable Preference Evaluation with Generation Separability

S Ghosh, T Srinivasan, S Swayamdipta - arxiv preprint arxiv:2407.01878, 2024 - arxiv.org

Human evaluation of generated language through pairwise preference judgments is
pervasive. However, under common scenarios, such as when generations from a model pair …

Save Cite Cited by 1 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Prediction-Powered Ranking of Large Language Models

I Chatzi, E Straitouri, S Thejaswi… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models are often ranked according to their level of alignment with human
preferences--a model is better than other models if its outputs are more frequently preferred …

Save Cite Cited by 3 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Elo uncovered: Robustness and best practices in language model evaluation

Foundational challenges in assuring alignment and safety of large language models

A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations

Protein language models are biased by unequal sequence sampling across the tree of life

Evaluating language model agency through negotiations

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmark

Automating government report generation: A generative ai approach for efficient data extraction, analysis, and visualization

PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison

Compare without Despair: Reliable Preference Evaluation with Generation Separability

Prediction-Powered Ranking of Large Language Models