An algorithmic perspective on imitation learning

T Osa, J Pajarinen, G Neumann… - … and Trends® in …, 2018 - nowpublishers.com
As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become …

A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier
Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

A survey of inverse reinforcement learning

S Adams, T Cody, PA Beling - Artificial Intelligence Review, 2022 - Springer
Learning from demonstration, or imitation learning, is the process of learning to act in an
environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a …

Weak human preference supervision for deep reinforcement learning

Z Cao, KC Wong, CT Lin - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
The current reward learning from human preferences could be used to resolve complex
reinforcement learning (RL) tasks without access to a reward function by defining a single …

Efficient exploration of reward functions in inverse reinforcement learning via Bayesian optimization

S Balakrishnan, QP Nguyen… - Advances in Neural …, 2020 - proceedings.neurips.cc
The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including
value alignment and robot learning from demonstration. Despite significant algorithmic …

Distributed inverse optimal control

W **, S Mou - Automatica, 2021 - Elsevier
This paper develops a distributed approach for inverse optimal control (IOC) in multi-agent
systems. Here each agent can only communicate with certain nearby neighbors and only …

Inverse learning for data-driven calibration of model-based statistical path planning

M Menner, K Berntorp, MN Zeilinger… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper presents a method for inverse learning of a control objective defined in terms of
requirements and their joint probability distribution from data. The probability distribution …

Learning models of sequential decision-making with partial specification of agent behavior

VV Unhelkar, JA Shah - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org
Artificial agents that interact with other (human or artificial) agents require models in order to
reason about those other agents' behavior. In addition to the predictive utility of these …

Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs

D Straub, M Schultheis, H Koeppl… - Advances in Neural …, 2024 - proceedings.neurips.cc
Inverse optimal control can be used to characterize behavior in sequential decision-making
tasks. Most existing work, however, is limited to fully observable or linear systems, or …

Shared control with human trust and workload models

M Cubuktepe, N Jansen… - Cyber–Physical–Human …, 2023 - Wiley Online Library
We synthesize shared control protocols subject to probabilistic temporal logic specifications.
Specifically, we develop a framework in which a human and an autonomy protocol can issue …