- Academic Search

Z Zhu, H Zhao - IEEE Transactions on Intelligent Transportation …, 2021 - ieeexplore.ieee.org

Autonomous driving (AD) agents generate driving policies based on online perception
results, which are obtained at multiple levels of abstraction, eg, behavior planning, motion …

Save Cite Cited by 188 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Learning multimodal rewards from rankings

V Myers, E Biyik, N Anari… - Conference on robot …, 2022 - proceedings.mlr.press

Learning from human feedback has shown to be a useful approach in acquiring robot
reward functions. However, expert feedback is often assumed to be drawn from an …

Save Cite Cited by 55 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions

J Chen, B Ganguly, Y Xu, Y Mei, T Lan… - arxiv preprint arxiv …, 2024 - arxiv.org

Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …

Save Cite Cited by 10 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Ess-infogail: Semi-supervised imitation learning from imbalanced demonstrations

H Fu, K Tang, Y Lu, Y Qi, G Deng… - Advances in Neural …, 2024 - proceedings.neurips.cc

Imitation learning aims to reproduce expert behaviors without relying on an explicit reward
signal. However, real-world demonstrations often present challenges, such as multi-modal …

Save Cite Cited by 3 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Adversarial option-aware hierarchical imitation learning

M **g, W Huang, F Sun, X Ma… - International …, 2021 - proceedings.mlr.press

It has been a challenge to learning skills for an agent from long-horizon unannotated
demonstrations. Existing approaches like Hierarchical Imitation Learning (HIL) are prone to …

Save Cite Cited by 27 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] bris.ac.uk

Lane change decision prediction: an efficient BO-XGB modelling approach with SHAP analysis

H Sun, Q Cheng, P Wang, Y Huang… - … A: Transport Science, 2024 - Taylor & Francis

The lane-change decision (LCD) is a critical aspect of driving behaviour. This study
proposes an LCD model based on a Bayesian optimization (BO) framework and extreme …

Save Cite Cited by 2 Related articles All 5 versions Free GPT-4

A dynamic test scenario generation method for autonomous vehicles based on conditional generative adversarial imitation learning

L Jia, D Yang, Y Ren, C Qian, Q Feng, B Sun… - Accident Analysis & …, 2024 - Elsevier

Autonomous vehicles must be comprehensively evaluated before deployed in cities and
highways. However, most existing evaluation approaches for autonomous vehicles are static …

Save Cite Cited by 8 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] mdpi.com

Data-Driven Policy Learning Methods from Biological Behavior: A Systematic Review

Y Wang, M Hayashibe, D Owaki - Applied Sciences, 2024 - mdpi.com

Policy learning enables agents to learn how to map states to actions, thus enabling adaptive
and flexible behavioral generation in complex environments. Policy learning methods are …

RTA-IR: A runtime assurance framework for behavior planning based on imitation learning and responsibility-sensitive safety model

Y Peng, G Tan, H Si - Expert Systems with Applications, 2023 - Elsevier

Current research on artificial intelligence (AI) algorithms in safety–critical areas remains
extremely challenging due to their inability to be fully verified at design time. In this paper …

Save Cite Cited by 8 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Hierarchical Imitation Learning for Stochastic Environments

M Igl, P Shah, P Mougin, S Srinivasan… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org

Many applications of imitation learning require the agent to generate the full distribution of
behaviour observed in the training data. For example, to evaluate the safety of autonomous …

Save Cite Cited by 1 Related articles All 3 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Triple-GAIL: a multi-modal imitation learning framework with generative adversarial nets

A survey of deep RL and IL for autonomous driving policy learning

Learning multimodal rewards from rankings

Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions

Ess-infogail: Semi-supervised imitation learning from imbalanced demonstrations

Adversarial option-aware hierarchical imitation learning

Lane change decision prediction: an efficient BO-XGB modelling approach with SHAP analysis

A dynamic test scenario generation method for autonomous vehicles based on conditional generative adversarial imitation learning

Data-Driven Policy Learning Methods from Biological Behavior: A Systematic Review

RTA-IR: A runtime assurance framework for behavior planning based on imitation learning and responsibility-sensitive safety model

Hierarchical Imitation Learning for Stochastic Environments