A brief overview of ChatGPT: The history, status quo and potential future development

T Wu, S He, J Liu, S Sun, K Liu… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
ChatGPT, an artificial intelligence generated content (AIGC) model developed by OpenAI,
has attracted world-wide attention for its capability of dealing with challenging language …

[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
Deep learning (DL) and reinforcement learning (RL) methods seem to be a part of
indispensable factors to achieve human-level or super-human AI systems. On the other …

Outracing champion Gran Turismo drivers with deep reinforcement learning

PR Wurman, S Barrett, K Kawamoto, J MacGlashan… - Nature, 2022 - nature.com
Many potential applications of artificial intelligence involve making real-time decisions in
physical systems while interacting with humans. Automobile racing represents an extreme …

Daydreamer: World models for physical robot learning

P Wu, A Escontrela, D Hafner… - … on robot learning, 2023 - proceedings.mlr.press
To solve tasks in complex environments, robots need to learn from experience. Deep
reinforcement learning is a common approach to robot learning but requires a large amount …

Social physics

M Jusup, P Holme, K Kanazawa, M Takayasu, I Romić… - Physics Reports, 2022 - Elsevier
Recent decades have seen a rise in the use of physics methods to study different societal
phenomena. This development has been due to physicists venturing outside of their …

Efficient online reinforcement learning with offline data

PJ Ball, L Smith, I Kostrikov… - … Conference on Machine …, 2023 - proceedings.mlr.press
Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …

Reinforcement learning algorithms: A brief survey

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier
Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential
decision-making in complex problems. RL is inspired by trial-and-error based human/animal …

Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning

M Nakamoto, S Zhai, A Singh… - Advances in …, 2024 - proceedings.neurips.cc
A compelling use case of offline reinforcement learning (RL) is to obtain a policy initialization
from existing datasets followed by fast online fine-tuning with limited interaction. However …

The primacy bias in deep reinforcement learning

E Nikishin, M Schwarzer, P D'Oro… - International …, 2022 - proceedings.mlr.press
This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …

How to train your robot with deep reinforcement learning: lessons we have learned

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …