Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges

Z Zhou, G Liu, Y Tang - arxiv preprint arxiv:2305.10091, 2023 - arxiv.org
Multi-agent reinforcement learning (MARL) is a widely used Artificial Intelligence (AI)
technique. However, current studies and applications need to address its scalability, non …

Constrained decision transformer for offline safe reinforcement learning

Z Liu, Z Guo, Y Yao, Z Cen, W Yu… - International …, 2023 - proceedings.mlr.press
Safe reinforcement learning (RL) trains a constraint satisfaction policy by interacting with the
environment. We aim to tackle a more challenging problem: learning a safe policy from an …

Safety gymnasium: A unified safe reinforcement learning benchmark

J Ji, B Zhang, J Zhou, X Pan… - Advances in …, 2023 - proceedings.neurips.cc
Artificial intelligence (AI) systems possess significant potential to drive societal progress.
However, their deployment often faces obstacles due to substantial safety concerns. Safe …

Curriculum reinforcement learning using optimal transport via gradual domain adaptation

P Huang, M Xu, J Zhu, L Shi… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Curriculum Reinforcement Learning (CRL) aims to create a sequence of tasks,
starting from easy ones and gradually learning towards difficult tasks. In this work, we focus …

Datasets and benchmarks for offline safe reinforcement learning

Z Liu, Z Guo, H Lin, Y Yao, J Zhu, Z Cen, H Hu… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents a comprehensive benchmarking suite tailored to offline safe
reinforcement learning (RL) challenges, aiming to foster progress in the development and …

Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers

L Yuan, Z Zhang, K Xue, H Yin, F Chen… - Proceedings of the …, 2023 - ojs.aaai.org
Abstract Cooperative Multi-agent Reinforcement Learning (CMARL) has shown to be
promising for many real-world applications. Previous works mainly focus on improving …

Beyond black-box advice: learning-augmented algorithms for MDPs with Q-value predictions

T Li, Y Lin, S Ren, A Wierman - Advances in Neural …, 2024 - proceedings.neurips.cc
We study the tradeoff between consistency and robustness in the context of a single-
trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned …

Learning shared safety constraints from multi-task demonstrations

K Kim, G Swamy, Z Liu, D Zhao… - Advances in Neural …, 2024 - proceedings.neurips.cc
Regardless of the particular task we want to perform in an environment, there are often
shared safety constraints we want our agents to respect. For example, regardless of whether …

Safety-aware causal representation for trustworthy offline reinforcement learning in autonomous driving

H Lin, W Ding, Z Liu, Y Niu, J Zhu… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
In the domain of autonomous driving, the offline Reinforcement Learning (RL) approaches
exhibit notable efficacy in addressing sequential decision-making problems from offline …