Google Akademik

Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems

L Von Rueden, S Mayer, K Beckh… - … on Knowledge and …, 2021 - ieeexplore.ieee.org

Despite its great success, machine learning can have its limits when dealing with insufficient
training data. A potential solution is the additional integration of prior knowledge into the …

Kaydet Alıntı yap Alıntılanma sayısı: 903 İlgili makaleler 9 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Scalable agent alignment via reward modeling: a research direction

J Leike, D Krueger, T Everitt, M Martic, V Maini… - arxiv preprint arxiv …, 2018 - arxiv.org

One obstacle to applying reinforcement learning algorithms to real-world problems is the
lack of suitable reward functions. Designing such reward functions is difficult in part because …

Kaydet Alıntı yap Alıntılanma sayısı: 369 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Direct preference optimization: Your language model is secretly a reward model

R Rafailov, A Sharma, E Mitchell… - Advances in …, 2023 - proceedings.neurips.cc

While large-scale unsupervised language models (LMs) learn broad world knowledge and
some reasoning skills, achieving precise control of their behavior is difficult due to the …

Kaydet Alıntı yap Alıntılanma sayısı: 2384 İlgili makaleler 9 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Statistical rejection sampling improves preference optimization

T Liu, Y Zhao, R Joshi, M Khalman, M Saleh… - arxiv preprint arxiv …, 2023 - arxiv.org

Improving the alignment of language models with human preferences remains an active
research challenge. Previous approaches have primarily utilized Reinforcement Learning …

Kaydet Alıntı yap Alıntılanma sayısı: 159 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] github.io

Fine-tuning language models from human preferences

DM Ziegler, N Stiennon, J Wu, TB Brown… - arxiv preprint arxiv …, 2019 - arxiv.org

Reward learning enables the application of reinforcement learning (RL) to tasks where
reward is defined by human judgment, building a model of reward by asking humans …

Kaydet Alıntı yap Alıntılanma sayısı: 1554 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Kto: Model alignment as prospect theoretic optimization

K Ethayarajh, W Xu, N Muennighoff, D Jurafsky… - arxiv preprint arxiv …, 2024 - arxiv.org

Kahneman & Tversky's $\textit {prospect theory} $ tells us that humans perceive random
variables in a biased but well-defined manner; for example, humans are famously loss …

Kaydet Alıntı yap Alıntılanma sayısı: 297 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep reinforcement learning for sequence-to-sequence models

Y Keneshloo, T Shi, N Ramakrishnan… - IEEE transactions on …, 2019 - ieeexplore.ieee.org

In recent times, sequence-to-sequence (seq2seq) models have gained a lot of popularity
and provide state-of-the-art performance in a wide variety of tasks, such as machine …

Kaydet Alıntı yap Alıntılanma sayısı: 309 İlgili makaleler 9 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Better rewards yield better summaries: Learning to summarise without references

F Böhm, Y Gao, CM Meyer, O Shapira, I Dagan… - arxiv preprint arxiv …, 2019 - arxiv.org

Reinforcement Learning (RL) based document summarisation systems yield state-of-the-art
performance in terms of ROUGE scores, because they directly use ROUGE as the rewards …

Kaydet Alıntı yap Alıntılanma sayısı: 126 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Smaug: Fixing failure modes of preference optimisation with dpo-positive

A Pal, D Karkhanis, S Dooley, M Roberts… - arxiv preprint arxiv …, 2024 - arxiv.org

Direct Preference Optimisation (DPO) is effective at significantly improving the performance
of large language models (LLMs) on downstream tasks such as reasoning, summarisation …

Kaydet Alıntı yap Alıntılanma sayısı: 90 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Aligning language models with human preferences via a bayesian approach

J Wang, H Wang, S Sun, W Li - Advances in Neural …, 2024 - proceedings.neurips.cc

In the quest to advance human-centric natural language generation (NLG) systems,
ensuring alignment between NLG models and human preferences is crucial. For this …

Kaydet Alıntı yap Alıntılanma sayısı: 16 İlgili makaleler 5 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Reliability and learnability of human bandit feedback for sequence-to-sequence reinforcement...

Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems

Scalable agent alignment via reward modeling: a research direction

Direct preference optimization: Your language model is secretly a reward model

Statistical rejection sampling improves preference optimization

Fine-tuning language models from human preferences

Kto: Model alignment as prospect theoretic optimization

Deep reinforcement learning for sequence-to-sequence models

Better rewards yield better summaries: Learning to summarise without references

Smaug: Fixing failure modes of preference optimisation with dpo-positive

Aligning language models with human preferences via a bayesian approach