Google Acadèmic

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - ar** these LLMs involves a complex yet …

Desa Cita Citat per 443 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Desa Cita Citat per 469 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Diffusion model alignment using direct preference optimization

B Wallace, M Dang, R Rafailov… - Proceedings of the …, 2024 - openaccess.thecvf.com

Large language models (LLMs) are fine-tuned using human comparison data with
Reinforcement Learning from Human Feedback (RLHF) methods to make them better …

Desa Cita Citat per 141 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Generative AI: Here to stay, but for good?

HS Sætra - Technology in Society, 2023 - Elsevier

Generative AI has taken the world by storm, kicked off for real by ChatGPT and quickly
followed by further development and the release of GPT-4 and similar models from OpenAI's …

Desa Cita Citat per 241 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Using large language models to simulate multiple humans and replicate human subject studies

GV Aher, RI Arriaga, AT Kalai - International Conference on …, 2023 - proceedings.mlr.press

We introduce a new type of test, called a Turing Experiment (TE), for evaluating to what
extent a given language model, such as GPT models, can simulate different aspects of …

Desa Cita Citat per 383 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Decoding ChatGPT: A taxonomy of existing research, current challenges, and possible future directions

SS Sohail, F Farhat, Y Himeur, M Nadeem… - Journal of King Saud …, 2023 - Elsevier

Abstract Chat Generative Pre-trained Transformer (ChatGPT) has gained significant interest
and attention since its launch in November 2022. It has shown impressive performance in …

Desa Cita Citat per 181 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards understanding sycophancy in language models

M Sharma, M Tong, T Korbak, D Duvenaud… - arxiv preprint arxiv …, 2023 - arxiv.org

Human feedback is commonly utilized to finetune AI assistants. But human feedback may
also encourage model responses that match user beliefs over truthful ones, a behaviour …

Desa Cita Citat per 171 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

Fine-tuning language models to find agreement among humans with diverse preferences

Ai alignment: A comprehensive survey

Open problems and fundamental limitations of reinforcement learning from human feedback

Diffusion model alignment using direct preference optimization

[HTML][HTML] Generative AI: Here to stay, but for good?

Using large language models to simulate multiple humans and replicate human subject studies

[HTML][HTML] Decoding ChatGPT: A taxonomy of existing research, current challenges, and possible future directions

Towards understanding sycophancy in language models