Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods

Y Cao, H Zhao, Y Cheng, T Shu, Y Chen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …

Comprehensive overview of reward engineering and sha** in advancing reinforcement learning applications

S Ibrahim, M Mostafa, A Jnadi, H Salloum… - IEEE …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) seeks to develop systems capable of autonomous decision-
making by learning through interaction with their environment. Central to this process are …

A survey on self-evolving autonomous driving: a perspective on data closed-loop technology

X Li, Z Wang, Y Huang, H Chen - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Self evolution refers to the ability of a system to evolve autonomously towards a better
performance, which is a potential trend for autonomous driving systems based on self …

ExSelfRL: An exploration-inspired self-supervised reinforcement learning approach to molecular generation

J Wang, F Zhu - Expert Systems with Applications, 2025 - Elsevier
Efficiently searching for novel molecules with specific properties is critical to molecular
generation. Some existing works focus on combining deep generative models and …

Wtoe: Learning when to explore in multiagent reinforcement learning

S Dong, H Mao, S Yang, S Zhu, W Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Existing multiagent exploration works focus on how to explore in the fully cooperative task,
which is insufficient in the environment with nonstationarity induced by agent interactions. To …

Decentralized counterfactual value with threat detection for multi-agent reinforcement learning in mixed cooperative and competitive environments

S Dong, C Li, S Yang, W Li, Y Gao - Expert Systems with Applications, 2024 - Elsevier
This paper proposes a fully decentralized approach to address the challenge of general
mixed cooperation and competition within the domain of Multi-Agent Reinforcement …

Exploratory optimal stop**: A singular control formulation

J Dianetti, G Ferrari, R Xu - ar** problems from a
reinforcement learning perspective. We begin by formulating the stop** problem using …

[PDF][PDF] Population-based diverse exploration for sparse-reward multi-agent tasks

P Xu, J Zhang, K Huang - Proceedings of the Thirty-Third International Joint …, 2024 - ijcai.org
Exploration under sparse rewards is a key challenge for multi-agent reinforcement learning
problems. Although population-based learning shows its potential in producing diverse …

Reward Sha** for Happier Autonomous Cyber Security Agents

E Bates, V Mavroudis, C Hicks - Proceedings of the 16th ACM Workshop …, 2023 - dl.acm.org
As machine learning models become more capable, they have exhibited increased potential
in solving complex tasks. One of the most promising directions uses deep reinforcement …

Drlc: Reinforcement learning with dense rewards from llm critic

M Cao, L Shu, L Yu, Y Zhu, N Wichers, Y Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Reinforcement learning (RL) can align language models with non-differentiable reward
signals, such as human preferences. However, a major challenge arises from the sparsity of …