Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

How to train your robot with deep reinforcement learning: lessons we have learned

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

[KIRJA][B] The alignment problem: How can machines learn human values?

B Christian - 2021 - books.google.com
'Vital reading. This is the book on artificial intelligence we need right now.'Mike Krieger,
cofounder of Instagram Artificial intelligence is rapidly dominating every aspect of our …

Maximum entropy RL (provably) solves some robust RL problems

B Eysenbach, S Levine - arxiv preprint arxiv:2103.06257, 2021 - arxiv.org
Many potential applications of reinforcement learning (RL) require guarantees that the agent
will perform well in the face of disturbances to the dynamics or reward function. In this paper …

Learning to walk via deep reinforcement learning

T Haarnoja, S Ha, A Zhou, J Tan, G Tucker… - arxiv preprint arxiv …, 2018 - arxiv.org
Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of
complex controllers that can map sensory inputs directly to low-level actions. In the domain …

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

Learning to walk in the real world with minimal human effort

S Ha, P Xu, Z Tan, S Levine, J Tan - arxiv preprint arxiv:2002.08550, 2020 - arxiv.org
Reliable and stable locomotion has been one of the most fundamental challenges for
legged robots. Deep reinforcement learning (deep RL) has emerged as a promising method …

Learning to be safe: Deep rl with a safety critic

K Srinivasan, B Eysenbach, S Ha, J Tan… - arxiv preprint arxiv …, 2020 - arxiv.org
Safety is an essential component for deploying reinforcement learning (RL) algorithms in
real-world scenarios, and is critical during the learning process itself. A natural first approach …