Μελετητής Google

B **, G Liu, C Han, M Jiang, H Ji… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Large language models (LLMs), such as GPT4 and LLaMA, are creating significant
advancements in natural language processing, due to their strong text encoding/decoding …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 131 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 226 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 439 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] mlr.press

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

A Pan, JS Chan, A Zou, N Li, S Basart… - International …, 2023 - proceedings.mlr.press

Artificial agents have traditionally been trained to maximize reward, which may incentivize
power-seeking and deception, analogous to how next-token prediction in language models …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 132 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

The curse of recursion: Training on generated data makes models forget

I Shumailov, Z Shumaylov, Y Zhao, Y Gal… - arxiv preprint arxiv …, 2023 - arxiv.org

Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3 (. 5) and
GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 286 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Efficient large language models: A survey

Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 126 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] jmlr.org

Curriculum learning for reinforcement learning domains: A framework and survey

S Narvekar, B Peng, M Leonetti, J Sinapov… - Journal of Machine …, 2020 - jmlr.org

Reinforcement learning (RL) is a popular paradigm for addressing sequential decision tasks
in which the agent has only limited environmental feedback. Despite many advances over …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 613 Σχετικά άρθρα Όλες οι 11 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Reinforcement learning in healthcare: A survey

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 783 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]

[PDF] springer.com

Generative artificial intelligence

L Banh, G Strobel - Electronic Markets, 2023 - Springer

Recent developments in the field of artificial intelligence (AI) have enabled new paradigms
of machine processing, shifting from data-driven, discriminative AI tasks toward …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 191 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]

[PDF] nsf.gov

In situ bidirectional human-robot value alignment

L Yuan, X Gao, Z Zheng, M Edmonds, YN Wu… - Science robotics, 2022 - science.org

A prerequisite for social coordination is bidirectional communication between teammates,
each playing two roles simultaneously: as receptive listeners and expressive speakers. For …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 91 Σχετικά άρθρα Όλες οι 6 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Policy sha**: Integrating human feedback with reinforcement learning

Large language models on graphs: A comprehensive survey

Ai alignment: A comprehensive survey

Open problems and fundamental limitations of reinforcement learning from human feedback

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

The curse of recursion: Training on generated data makes models forget

Efficient large language models: A survey

Curriculum learning for reinforcement learning domains: A framework and survey

Reinforcement learning in healthcare: A survey

Generative artificial intelligence

In situ bidirectional human-robot value alignment