Quality-similar diversity via population based reinforcement learning

S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang… - The Eleventh …, 2023 - openreview.net
Diversity is a growing research topic in Reinforcement Learning (RL). Previous research on
diversity has mainly focused on promoting diversity to encourage exploration and thereby …

Reinforcement learning by guided safe exploration

Q Yang, TD Simão, N Jansen, SH Tindemans… - ECAI 2023, 2023 - ebooks.iospress.nl
Safety is critical to broadening the application of reinforcement learning (RL). Often, we train
RL agents in a controlled environment, such as a laboratory, before deploying them in the …

[PDF][PDF] Training and transferring safe policies in reinforcement learning

Q Yang, T Simão, N Jansen, S Tindemans, M Spaan - 2022 - repository.ubn.ru.nl
Safety is critical to broadening the application of reinforcement learning (RL). Often, RL
agents are trained in a controlled environment, such as a laboratory, before being deployed …

Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI

S Tulli, SL Vasileiou, S Sreedharan - arxiv preprint arxiv:2405.07773, 2024 - arxiv.org
" Human-aware" has become a popular keyword used to describe a particular class of AI
systems that are designed to work and interact with humans. While there exists a surprising …

Inverse Reinforcement Learning with Learning and Leveraging Demonstrators' Varying Expertise Levels

S Oguchienti, M Ghasemi - 2023 59th Annual Allerton …, 2023 - ieeexplore.ieee.org
A common assumption in most Inverse Reinforcement Learning (IRL) methods is that human
demonstrations are drawn from an optimal policy. However, this assumption poses a …

[HTML][HTML] Safe Online and Offline Reinforcement Learning

TD Simão - 2023 - research.tudelft.nl
Reinforcement Learning (RL) agents can solve general problems based on little to no
knowledge of the underlying environment. These agents learn through experience, using a …

A population diversity-based robust policy generation method in adversarial game environments# br

S ZHUANG, Y CHEN, Y HAO, W WU… - Computer …, 2024 - joces.nudt.edu.cn
In adversarial game environments, the objective agent aims to generate robust game
policies, ensuring high returns when facing different opponent policies consistently. Existing …

Control, Learning and Adaptation in Information-Constrained, Adversarial Environments

YE Bayiz, S Carr, ES Crafts, M Cubuktepe, F Djeumou… - 2023 - apps.dtic.mil
We propose to develop a theoretical and algorithmic foundation that will help create
autonomous robotic agents capable of executing patrol missions in urban environments …

对抗环境中基于种群多样性的鲁棒策略生成方法

庄述鑫, 陈永红, 郝一行, 吴巍炜, 徐学永… - 计算机工程与 …, 2024 - joces.nudt.edu.cn
在对抗博弈环境中, 目标智能体希望生成具有高鲁棒性的博弈策略, 使得目标智能体在面对不同
对手策略时, 始终具有较高的收益. 现有的基于自我博弈的策略生成方法通常会过拟合到针对 …