Human-robot teaming: grand challenges

M Natarajan, E Seraj, B Altundas, R Paleja, S Ye… - Current Robotics …, 2023 - Springer
Abstract Purpose of Review Current real-world interaction between humans and robots is
extremely limited. We present challenges that, if addressed, will enable humans and robots …

Mimicgen: A data generation system for scalable robot learning using human demonstrations

A Mandlekar, S Nasiriany, B Wen, I Akinola… - arxiv preprint arxiv …, 2023 - arxiv.org
Imitation learning from a large set of human demonstrations has proved to be an effective
paradigm for building capable robot agents. However, the demonstrations can be extremely …

Few-shot preference learning for human-in-the-loop rl

DJ Hejna III, D Sadigh - Conference on Robot Learning, 2023 - proceedings.mlr.press
While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …

Data quality in imitation learning

S Belkhale, Y Cui, D Sadigh - Advances in neural …, 2023 - proceedings.neurips.cc
In supervised learning, the question of data quality and curation has been sidelined in
recent years in favor of increasingly more powerful and expressive models that can ingest …

i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops

SW Abeyruwan, L Graesser… - … on Robot Learning, 2023 - proceedings.mlr.press
Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to
train policies in simulation enables safe exploration and large-scale data collection quickly …

Imitation learning by estimating expertise of demonstrators

M Beliaev, A Shih, S Ermon, D Sadigh… - International …, 2022 - proceedings.mlr.press
Many existing imitation learning datasets are collected from multiple demonstrators, each
with different expertise at different parts of the environment. Yet, standard imitation learning …

Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences

E Bıyık, DP Losey, M Palan… - … Journal of Robotics …, 2022 - journals.sagepub.com
Reward functions are a common way to specify the objective of a robot. As designing reward
functions can be extremely challenging, a more promising approach is to directly learn …

Confidence-aware imitation learning from demonstrations with varying optimality

S Zhang, Z Cao, D Sadigh… - Advances in Neural …, 2021 - proceedings.neurips.cc
Most existing imitation learning approaches assume the demonstrations are drawn from
experts who are optimal, but relaxing this assumption enables us to use a wider range of …

Efficient preference-based reinforcement learning using learned dynamics models

Y Liu, G Datta, E Novoseller… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Preference-based reinforcement learning (PbRL) can enable robots to learn to perform tasks
based on an individual's preferences without requiring a hand-crafted re-ward function …

Robotic table tennis: A case study into a high speed learning system

DB D'Ambrosio, J Abelian, S Abeyruwan, M Ahn… - arxiv preprint arxiv …, 2023 - arxiv.org
We present a deep-dive into a real-world robotic learning system that, in previous work, was
shown to be capable of hundreds of table tennis rallies with a human and has the ability to …