Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods

Y Cao, H Zhao, Y Cheng, T Shu, Y Chen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …

SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation

B Stoler, I Navarro, J Francis, J Oh - arxiv preprint arxiv:2409.10320, 2024 - arxiv.org
Verification and validation of autonomous driving (AD) systems and components is of
increasing importance, as such technology increases in real-world prevalence. Safety …

Inductive learning of robot task knowledge from raw data and online expert feedback

D Meli, P Fiorini - Machine Learning, 2025 - Springer
The increasing level of autonomy of robots poses challenges of trust and social acceptance,
especially in human-robot interaction scenarios. This requires an interpretable …

NPE-DRL: Enhancing Perception Constrained Obstacle Avoidance with Non-Expert Policy Guided Reinforcement Learning

Y Zhang, C Yan, J **ao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Obstacle avoidance under constrained visual perception presents a significant challenge,
requiring rapid detection and decision-making within partially observable environments …

Reachability-Aware Reinforcement Learning for Collision Avoidance in Human-Machine Shared Control

S Zhao, J Zhang, N Masoud, J Li, Y Zheng… - arxiv preprint arxiv …, 2025 - arxiv.org
Human-machine shared control in critical collision scenarios aims to aid drivers' accident
avoidance through intervening only when necessary. Existing methods count on replanning …

[HTML][HTML] Clever Hans in the Loop? A Critical Examination of ChatGPT in a Human-in-the-Loop Framework for Machinery Functional Safety Risk Analysis

P Iyenghar - Eng, 2025 - mdpi.com
This paper presents a first-of-its-kind evaluation of integrating Large Language Models
(LLMs) within a Human-In-The-Loop (HITL) framework for risk analysis in machinery …

[HTML][HTML] Behavior Safety Decision-Making Based on Deep Deterministic Policy Gradient and Its Verification Method

Y Zhu, Z Li, J Wang, Y Zhao, M Li - Symmetry, 2025 - mdpi.com
As an emerging mode of transportation, autonomous vehicles are increasingly attracting
widespread attention. To address the issues of the traditional reinforcement learning …