Google Tudós

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Mentés Hivatkozás Idézetek száma: 487 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text

S Gehrmann, E Clark, T Sellam - Journal of Artificial Intelligence Research, 2023 - jair.org

Abstract Evaluation practices in natural language generation (NLG) have many known flaws,
but improved evaluation approaches are rarely widely adopted. This issue has become …

Mentés Hivatkozás Idézetek száma: 156 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Mentés Hivatkozás Idézetek száma: 469 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

A holistic approach to undesired content detection in the real world

T Markov, C Zhang, S Agarwal, FE Nekoul… - Proceedings of the …, 2023 - ojs.aaai.org

We present a holistic approach to building a robust and useful natural language
classification system for real-world content moderation. The success of such a system relies …

Mentés Hivatkozás Idézetek száma: 214 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The'Problem'of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation

B Plank - arxiv preprint arxiv:2211.02570, 2022 - arxiv.org

Human variation in labeling is often considered noise. Annotation projects for machine
learning (ML) aim at minimizing human label variation, with the assumption to maximize …

Mentés Hivatkozás Idézetek száma: 197 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Dealing with disagreements: Looking beyond the majority vote in subjective annotations

AM Davani, M Díaz, V Prabhakaran - Transactions of the Association …, 2022 - direct.mit.edu

Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …

Mentés Hivatkozás Idézetek száma: 344 Kapcsolódó cikkek Mind a(z) 9 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Evaluating the social impact of generative ai systems in systems and society

I Solaiman, Z Talat, W Agnew, L Ahmad… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative AI systems across modalities, ranging from text (including code), image, audio,
and video, have broad social impacts, but there is no official standard for means of …

Mentés Hivatkozás Idézetek száma: 115 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Jury learning: Integrating dissenting voices into machine learning models

ML Gordon, MS Lam, JS Park, K Patel… - Proceedings of the …, 2022 - dl.acm.org

Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks
ranging from online comment toxicity to misinformation detection to medical diagnosis …

Mentés Hivatkozás Idézetek száma: 209 Kapcsolódó cikkek Mind a(z) 9 változat

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Bridging the gap: A survey on integrating (human) feedback for natural language generation

P Fernandes, A Madaan, E Liu, A Farinhas… - Transactions of the …, 2023 - direct.mit.edu

Natural language generation has witnessed significant advancements due to the training of
large language models on vast internet-scale datasets. Despite these advancements, there …

Mentés Hivatkozás Idézetek száma: 76 Kapcsolódó cikkek Mind a(z) 10 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The prism alignment project: What participatory, representative and individualised human feedback reveals about the subjective and multicultural alignment of large …

HR Kirk, A Whitefield, P Röttger, A Bean… - arxiv preprint arxiv …, 2024 - arxiv.org

Human feedback plays a central role in the alignment of Large Language Models (LLMs).
However, open questions remain about the methods (how), domains (where), people (who) …

Mentés Hivatkozás Idézetek száma: 65 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

On releasing annotator-level labels and information in datasets

Challenges and applications of large language models

Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text

Open problems and fundamental limitations of reinforcement learning from human feedback

A holistic approach to undesired content detection in the real world

The'Problem'of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation

Dealing with disagreements: Looking beyond the majority vote in subjective annotations

Evaluating the social impact of generative ai systems in systems and society

Jury learning: Integrating dissenting voices into machine learning models

Bridging the gap: A survey on integrating (human) feedback for natural language generation

The prism alignment project: What participatory, representative and individualised human feedback reveals about the subjective and multicultural alignment of large …