- Academic Search

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

Tallenna Viittaa Viittausten määrä 243 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] sagepub.com

How to train your robot with deep reinforcement learning: lessons we have learned

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com

Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …

Tallenna Viittaa Viittausten määrä 698 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Tallenna Viittaa Viittausten määrä 4771 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Tallenna Viittaa Viittausten määrä 2198 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] unicenter.pt

[KIRJA][B] The alignment problem: How can machines learn human values?

B Christian - 2021 - books.google.com

'Vital reading. This is the book on artificial intelligence we need right now.'Mike Krieger,
cofounder of Instagram Artificial intelligence is rapidly dominating every aspect of our …

Tallenna Viittaa Viittausten määrä 639 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota Kirjastohaku

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Maximum entropy RL (provably) solves some robust RL problems

B Eysenbach, S Levine - arxiv preprint arxiv:2103.06257, 2021 - arxiv.org

Many potential applications of reinforcement learning (RL) require guarantees that the agent
will perform well in the face of disturbances to the dynamics or reward function. In this paper …

Tallenna Viittaa Viittausten määrä 211 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to walk via deep reinforcement learning

T Haarnoja, S Ha, A Zhou, J Tan, G Tucker… - arxiv preprint arxiv …, 2018 - arxiv.org

Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of
complex controllers that can map sensory inputs directly to low-level actions. In the domain …

Tallenna Viittaa Viittausten määrä 585 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

Tallenna Viittaa Viittausten määrä 262 Aiheeseen liittyviä artikkeleita Kaikki 7 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to walk in the real world with minimal human effort

S Ha, P Xu, Z Tan, S Levine, J Tan - arxiv preprint arxiv:2002.08550, 2020 - arxiv.org

Reliable and stable locomotion has been one of the most fundamental challenges for
legged robots. Deep reinforcement learning (deep RL) has emerged as a promising method …

Tallenna Viittaa Viittausten määrä 194 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to be safe: Deep rl with a safety critic

K Srinivasan, B Eysenbach, S Ha, J Tan… - arxiv preprint arxiv …, 2020 - arxiv.org

Safety is an essential component for deploying reinforcement learning (RL) algorithms in
real-world scenarios, and is critical during the learning process itself. A natural first approach …

Tallenna Viittaa Viittausten määrä 174 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

Leave no trace: Learning to reset for safe and autonomous reinforcement learning

Ai alignment: A comprehensive survey

How to train your robot with deep reinforcement learning: lessons we have learned

On the opportunities and risks of foundation models

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

[KIRJA][B] The alignment problem: How can machines learn human values?

Maximum entropy RL (provably) solves some robust RL problems

Learning to walk via deep reinforcement learning

Recovery rl: Safe reinforcement learning with learned recovery zones

Learning to walk in the real world with minimal human effort

Learning to be safe: Deep rl with a safety critic