Learning from all vehicles

D Chen, P Krähenbühl - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
In this paper, we present a system to train driving policies from experiences collected not just
from the ego-vehicle, but all vehicles that it observes. This system uses the behaviors of …

Predictive representations: Building blocks of intelligence

W Carvalho, MS Tomov, W de Cothi, C Barry… - Neural …, 2024 - direct.mit.edu
Adaptive behavior often requires predicting future events. The theory of reinforcement
learning prescribes what kinds of predictive representations are useful and how to compute …

Trihelper: Zero-shot object navigation with dynamic assistance

L Zhang, Q Zhang, H Wang, E **ao… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Navigating toward specific objects in unknown environments without additional training,
known as Zero-Shot object navigation, poses a significant challenge in the field of robotics …

Combining behaviors with the successor features keyboard

WC Carvalho, A Saraiva, A Filos… - Advances in neural …, 2023 - proceedings.neurips.cc
Abstract The Option Keyboard (OK) was recently proposed as a method for transferring
behavioral knowledge across tasks. OK transfers knowledge by adaptively combining …

Cacto: Continuous actor-critic with trajectory optimization—towards global optimality

G Grandesso, E Alboni, GPR Papini… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
This letter presents a novel algorithm for the continuous control of dynamical systems that
combines Trajectory Optimization (TO) and Reinforcement Learning (RL) in a single …

Self-supervised reinforcement learning that transfers using random features

B Chen, C Zhu, P Agrawal… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Model-free reinforcement learning algorithms have exhibited great potential in
solving single-task sequential decision-making problems with high-dimensional …

Spatial-temporal causality modeling for industrial processes with a knowledge-data guided reinforcement learning

X Zhang, C Song, J Zhao, Z Xu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Causality in an industrial process provides insights into how various process variables
interact and affect each other within the system. It reveals the underlying mechanisms of …

Composing task knowledge with modular successor feature approximators

W Carvalho, A Filos, RL Lewis, S Singh - arxiv preprint arxiv …, 2023 - arxiv.org
Recently, the Successor Features and Generalized Policy Improvement (SF&GPI) framework
has been proposed as a method for learning, composing, and transferring predictive …

Contrastive value learning: Implicit models for simple offline rl

B Mazoure, B Eysenbach, O Nachum… - … on Robot Learning, 2023 - proceedings.mlr.press
Abstract Model-based reinforcement learning (RL) methods are appealing in the offline
setting because they allow an agent to reason about the consequences of actions without …

Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review

H Hassani, R Razavi-Far, M Saif, L Lin - arxiv preprint arxiv:2411.10268, 2024 - arxiv.org
Reinforcement learning (RL) is a sub-domain of machine learning, mainly concerned with
solving sequential decision-making problems by a learning agent that interacts with the …