Quality-similar diversity via population based reinforcement learning
Diversity is a growing research topic in Reinforcement Learning (RL). Previous research on
diversity has mainly focused on promoting diversity to encourage exploration and thereby …
diversity has mainly focused on promoting diversity to encourage exploration and thereby …
Reinforcement learning by guided safe exploration
Safety is critical to broadening the application of reinforcement learning (RL). Often, we train
RL agents in a controlled environment, such as a laboratory, before deploying them in the …
RL agents in a controlled environment, such as a laboratory, before deploying them in the …
[PDF][PDF] Training and transferring safe policies in reinforcement learning
Safety is critical to broadening the application of reinforcement learning (RL). Often, RL
agents are trained in a controlled environment, such as a laboratory, before being deployed …
agents are trained in a controlled environment, such as a laboratory, before being deployed …
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI
" Human-aware" has become a popular keyword used to describe a particular class of AI
systems that are designed to work and interact with humans. While there exists a surprising …
systems that are designed to work and interact with humans. While there exists a surprising …
Inverse Reinforcement Learning with Learning and Leveraging Demonstrators' Varying Expertise Levels
A common assumption in most Inverse Reinforcement Learning (IRL) methods is that human
demonstrations are drawn from an optimal policy. However, this assumption poses a …
demonstrations are drawn from an optimal policy. However, this assumption poses a …
[HTML][HTML] Safe Online and Offline Reinforcement Learning
TD Simão - 2023 - research.tudelft.nl
Reinforcement Learning (RL) agents can solve general problems based on little to no
knowledge of the underlying environment. These agents learn through experience, using a …
knowledge of the underlying environment. These agents learn through experience, using a …
A population diversity-based robust policy generation method in adversarial game environments# br
S ZHUANG, Y CHEN, Y HAO, W WU… - Computer …, 2024 - joces.nudt.edu.cn
In adversarial game environments, the objective agent aims to generate robust game
policies, ensuring high returns when facing different opponent policies consistently. Existing …
policies, ensuring high returns when facing different opponent policies consistently. Existing …
Control, Learning and Adaptation in Information-Constrained, Adversarial Environments
We propose to develop a theoretical and algorithmic foundation that will help create
autonomous robotic agents capable of executing patrol missions in urban environments …
autonomous robotic agents capable of executing patrol missions in urban environments …
对抗环境中基于种群多样性的鲁棒策略生成方法
庄述鑫, 陈永红, 郝一行, 吴巍炜, 徐学永… - 计算机工程与 …, 2024 - joces.nudt.edu.cn
在对抗博弈环境中, 目标智能体希望生成具有高鲁棒性的博弈策略, 使得目标智能体在面对不同
对手策略时, 始终具有较高的收益. 现有的基于自我博弈的策略生成方法通常会过拟合到针对 …
对手策略时, 始终具有较高的收益. 现有的基于自我博弈的策略生成方法通常会过拟合到针对 …