Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods
With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …
Comprehensive overview of reward engineering and sha** in advancing reinforcement learning applications
Reinforcement Learning (RL) seeks to develop systems capable of autonomous decision-
making by learning through interaction with their environment. Central to this process are …
making by learning through interaction with their environment. Central to this process are …
A survey on self-evolving autonomous driving: a perspective on data closed-loop technology
Self evolution refers to the ability of a system to evolve autonomously towards a better
performance, which is a potential trend for autonomous driving systems based on self …
performance, which is a potential trend for autonomous driving systems based on self …
ExSelfRL: An exploration-inspired self-supervised reinforcement learning approach to molecular generation
J Wang, F Zhu - Expert Systems with Applications, 2025 - Elsevier
Efficiently searching for novel molecules with specific properties is critical to molecular
generation. Some existing works focus on combining deep generative models and …
generation. Some existing works focus on combining deep generative models and …
Wtoe: Learning when to explore in multiagent reinforcement learning
Existing multiagent exploration works focus on how to explore in the fully cooperative task,
which is insufficient in the environment with nonstationarity induced by agent interactions. To …
which is insufficient in the environment with nonstationarity induced by agent interactions. To …
Decentralized counterfactual value with threat detection for multi-agent reinforcement learning in mixed cooperative and competitive environments
This paper proposes a fully decentralized approach to address the challenge of general
mixed cooperation and competition within the domain of Multi-Agent Reinforcement …
mixed cooperation and competition within the domain of Multi-Agent Reinforcement …
Exploratory optimal stop**: A singular control formulation
J Dianetti, G Ferrari, R Xu - ar** problems from a
reinforcement learning perspective. We begin by formulating the stop** problem using …
reinforcement learning perspective. We begin by formulating the stop** problem using …
[PDF][PDF] Population-based diverse exploration for sparse-reward multi-agent tasks
P Xu, J Zhang, K Huang - Proceedings of the Thirty-Third International Joint …, 2024 - ijcai.org
Exploration under sparse rewards is a key challenge for multi-agent reinforcement learning
problems. Although population-based learning shows its potential in producing diverse …
problems. Although population-based learning shows its potential in producing diverse …
Reward Sha** for Happier Autonomous Cyber Security Agents
As machine learning models become more capable, they have exhibited increased potential
in solving complex tasks. One of the most promising directions uses deep reinforcement …
in solving complex tasks. One of the most promising directions uses deep reinforcement …
Drlc: Reinforcement learning with dense rewards from llm critic
Reinforcement learning (RL) can align language models with non-differentiable reward
signals, such as human preferences. However, a major challenge arises from the sparsity of …
signals, such as human preferences. However, a major challenge arises from the sparsity of …