- Academic Search

A Albalak, Y Elazar, SM **e, S Longpre… - arxiv preprint arxiv …, 2024 - arxiv.org

A major factor in the recent success of large language models is the use of enormous and
ever-growing text datasets for unsupervised pre-training. However, naively training a model …

Speichern Zitieren Zitiert von: 73 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Speichern Zitieren Zitiert von: 116 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

How johnny can persuade llms to jailbreak them: Rethinking persuasion to challenge ai safety by humanizing llms

Y Zeng, H Lin, J Zhang, D Yang, R Jia… - arxiv preprint arxiv …, 2024 - arxiv.org

Most traditional AI safety research has approached AI models as machines and centered on
algorithm-focused attacks developed by security experts. As large language models (LLMs) …

Speichern Zitieren Zitiert von: 179 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Olmo: Accelerating the science of language models

D Groeneveld, I Beltagy, P Walsh, A Bhagia… - arxiv preprint arxiv …, 2024 - arxiv.org

Language models (LMs) have become ubiquitous in both NLP research and in commercial
product offerings. As their commercial importance has surged, the most powerful models …

Speichern Zitieren Zitiert von: 125 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Bigcodebench: Benchmarking code generation with diverse function calls and complex instructions

TY Zhuo, MC Vu, J Chim, H Hu, W Yu… - arxiv preprint arxiv …, 2024 - arxiv.org

Task automation has been greatly empowered by the recent advances in Large Language
Models (LLMs) via Python code, where the tasks ranging from software engineering …

Speichern Zitieren Zitiert von: 47 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] openreview.net

No" zero-shot" without exponential data: Pretraining concept frequency determines multimodal model performance

V Udandarao, A Prabhu, A Ghosh… - The Thirty-eighth …, 2024 - openreview.net

Web-crawled pretraining datasets underlie the impressive" zero-shot" evaluation
performance of multimodal models, such as CLIP for classification and Stable-Diffusion for …

Speichern Zitieren Zitiert von: 41 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] cell.com Full View

An archival perspective on pretraining data

MA Desai, IV Pasquetto, AZ Jacobs, D Card - Patterns, 2024 - cell.com

Alongside an explosion in research and development related to large language models,
there has been a concomitant rise in the creation of pretraining datasets—massive …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] arxiv.org

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arxiv preprint arxiv …, 2024 - arxiv.org

AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …

Speichern Zitieren Zitiert von: 22 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

The bias amplification paradox in text-to-image generation

P Seshadri, S Singh, Y Elazar - arxiv preprint arxiv:2308.00755, 2023 - arxiv.org

Bias amplification is a phenomenon in which models increase imbalances present in the
training data. In this paper, we study bias amplification in the text-to-image domain using …

Speichern Zitieren Zitiert von: 38 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Evaluating copyright takedown methods for language models

B Wei, W Shi, Y Huang, NA Smith, C Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

Language models (LMs) derive their capabilities from extensive training on diverse data,
including potentially copyrighted material. These models can memorize and generate …

Speichern Zitieren Zitiert von: 11 Ähnliche Artikel Alle 4 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

What's In My Big Data?

A survey on data selection for language models

Foundational challenges in assuring alignment and safety of large language models

How johnny can persuade llms to jailbreak them: Rethinking persuasion to challenge ai safety by humanizing llms

Olmo: Accelerating the science of language models

Bigcodebench: Benchmarking code generation with diverse function calls and complex instructions

No" zero-shot" without exponential data: Pretraining concept frequency determines multimodal model performance

An archival perspective on pretraining data

Open problems in technical ai governance

The bias amplification paradox in text-to-image generation

Evaluating copyright takedown methods for language models