- Academic Search

T Osa, J Pajarinen, G Neumann… - … and Trends® in …, 2018 - nowpublishers.com

As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become …

Save Cite Cited by 1003 Related articles All 14 versions Free GPT-4 DeepSeek Library Search Cached

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier

Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

Save Cite Cited by 820 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A survey of inverse reinforcement learning

S Adams, T Cody, PA Beling - Artificial Intelligence Review, 2022 - Springer

Learning from demonstration, or imitation learning, is the process of learning to act in an
environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a …

Save Cite Cited by 123 Related articles All 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Weak human preference supervision for deep reinforcement learning

Z Cao, KC Wong, CT Lin - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org

The current reward learning from human preferences could be used to resolve complex
reinforcement learning (RL) tasks without access to a reward function by defining a single …

Save Cite Cited by 49 Related articles All 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Efficient exploration of reward functions in inverse reinforcement learning via Bayesian optimization

S Balakrishnan, QP Nguyen… - Advances in Neural …, 2020 - proceedings.neurips.cc

The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including
value alignment and robot learning from demonstration. Despite significant algorithmic …

Save Cite Cited by 32 Related articles All 7 versions Free GPT-4 DeepSeek View as HTML

Distributed inverse optimal control

W **, S Mou - Automatica, 2021 - Elsevier

This paper develops a distributed approach for inverse optimal control (IOC) in multi-agent
systems. Here each agent can only communicate with certain nearby neighbors and only …

Save Cite Cited by 23 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] ethz.ch

Inverse learning for data-driven calibration of model-based statistical path planning

M Menner, K Berntorp, MN Zeilinger… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

This paper presents a method for inverse learning of a control objective defined in terms of
requirements and their joint probability distribution from data. The probability distribution …

Save Cite Cited by 22 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Learning models of sequential decision-making with partial specification of agent behavior

VV Unhelkar, JA Shah - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org

Artificial agents that interact with other (human or artificial) agents require models in order to
reason about those other agents' behavior. In addition to the predictive utility of these …

Save Cite Cited by 29 Related articles All 10 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs

D Straub, M Schultheis, H Koeppl… - Advances in Neural …, 2024 - proceedings.neurips.cc

Inverse optimal control can be used to characterize behavior in sequential decision-making
tasks. Most existing work, however, is limited to fully observable or linear systems, or …

Save Cite Cited by 2 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

Shared control with human trust and workload models

M Cubuktepe, N Jansen… - Cyber–Physical–Human …, 2023 - Wiley Online Library

We synthesize shared control protocols subject to probabilistic temporal logic specifications.
Specifically, we develop a framework in which a human and an autonomy protocol can issue …

Save Cite Cited by 3 Related articles All 4 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Expectation-maximization for inverse reinforcement learning with hidden data

An algorithmic perspective on imitation learning

A survey of inverse reinforcement learning: Challenges, methods and progress

A survey of inverse reinforcement learning

Weak human preference supervision for deep reinforcement learning

Efficient exploration of reward functions in inverse reinforcement learning via Bayesian optimization

Distributed inverse optimal control

Inverse learning for data-driven calibration of model-based statistical path planning

Learning models of sequential decision-making with partial specification of agent behavior

Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs

Shared control with human trust and workload models