Safety gymnasium: A unified safe reinforcement learning benchmark J Ji, B Zhang, J Zhou, X Pan, W Huang, R Sun, Y Geng, Y Zhong, J Dai, ... Advances in Neural Information Processing Systems 36, 2023 | 58 | 2023 |
Heterogeneous-agent reinforcement learning Y Zhong, JG Kuba, X Feng, S Hu, J Ji, Y Yang Journal of Machine Learning Research 25 (1-67), 1, 2024 | 49 | 2024 |
Marllib: A scalable and efficient multi-agent reinforcement learning library S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang Journal of Machine Learning Research 24 (315), 1-23, 2023 | 23 | 2023 |
Panacea: Pareto alignment via preference adaptation for llms Y Zhong, C Ma, X Zhang, Z Yang, H Chen, Q Zhang, S Qi, Y Yang arXiv preprint arXiv:2402.02030, 2024 | 21 | 2024 |
CivRealm: A learning and reasoning odyssey in Civilization for decision-making agents S Qi, S Chen, Y Li, X Kong, J Wang, B Yang, P Wong, Y Zhong, X Zhang, ... arXiv preprint arXiv:2401.10568, 2024 | 12 | 2024 |
Marllib: Extending rllib for multi-agent reinforcement learning S Hu, Y Zhong, M Gao, W Wang, H Dong, Z Li, X Liang, Y Yang, X Chang | 12 | 2022 |
Maximum Entropy Heterogeneous-Agent Reinforcement Learning J Liu, Y Zhong, S Hu, H Fu, Q Fu, X Chang, Y Yang The Twelfth International Conference on Learning Representations, 0 | 11 | |
MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand V Caggiano, G Durandau, H Wang, A Chiappa, A Mathis, P Tano, N Patel, ... NeurIPS 2022 Competition Track, 233-250, 2023 | 8 | 2023 |
Marllib: A scalable multi-agent reinforcement learning library S Hu, Y Zhong, M Gao, W Wang, H Dong, Z Li, X Liang, X Chang, Y Yang arXiv preprint arXiv:2210.13708, 2022 | 4 | 2022 |
Maximum Entropy Heterogeneous-Agent Mirror Learning J Liu, Y Zhong, S Hu, H Fu, Q Fu, X Chang, Y Yang arXiv preprint arXiv:2306.10715, 2023 | 2 | 2023 |
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang arXiv preprint arXiv:2210.13708, 2022 | 2 | 2022 |
Off-agent trust region policy optimization R Chen, X Zhang, Y Du, Y Zhong, Z Tian, F Sun, Y Yang International Joint Conference on Artificial Intelligence, 2024 | | 2024 |