A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

Human-to-robot imitation in the wild

S Bahl, A Gupta, D Pathak - arxiv preprint arxiv:2207.09450, 2022 - arxiv.org
We approach the problem of learning by watching humans in the wild. While traditional
approaches in Imitation and Reinforcement Learning are promising for learning in the real …

CORL: Research-oriented deep offline reinforcement learning library

D Tarasov, A Nikulin, D Akimov… - Advances in …, 2024 - proceedings.neurips.cc
CORL is an open-source library that provides thoroughly benchmarked single-file
implementations of both deep offline and offline-to-online reinforcement learning algorithms …

Differentiable integrated motion prediction and planning with learnable cost function for autonomous driving

Z Huang, H Liu, J Wu, C Lv - IEEE transactions on neural …, 2023 - ieeexplore.ieee.org
Predicting the future states of surrounding traffic participants and planning a safe, smooth,
and socially compliant trajectory accordingly are crucial for autonomous vehicles (AVs) …

Videodex: Learning dexterity from internet videos

K Shaw, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press
To build general robotic agents that can operate in many environments, it is often imperative
for the robot to collect experience in the real world. However, this is often not feasible due to …

Tianshou: A highly modularized deep reinforcement learning library

J Weng, H Chen, D Yan, K You, A Duburcq… - Journal of Machine …, 2022 - jmlr.org
In this paper, we present Tianshou, a highly modularized Python library for deep
reinforcement learning (DRL) that uses PyTorch as its backend. Tianshou intends to be …

You can't count on luck: Why decision transformers and rvs fail in stochastic environments

K Paster, S McIlraith, J Ba - Advances in neural information …, 2022 - proceedings.neurips.cc
Recently, methods such as Decision Transformer that reduce reinforcement learning to a
prediction task and solve it via supervised learning (RvS) have become popular due to their …

[HTML][HTML] Breaking new ground: Opportunities and challenges in tunnel boring machine operations with integrated management systems and artificial intelligence

J Loy-Benitez, MK Song, YH Choi, JK Lee… - Automation in …, 2024 - Elsevier
Advances in tunnel boring machines (TBM) have leveraged applied artificial intelligence to
promote sustainable and automatic tunneling construction. This paper highlights the …

Lapo: Latent-variable advantage-weighted policy optimization for offline reinforcement learning

X Chen, A Ghadirzadeh, T Yu, J Wang… - Advances in …, 2022 - proceedings.neurips.cc
Offline reinforcement learning methods hold the promise of learning policies from pre-
collected datasets without the need to query the environment for new samples. This setting …