Follow
Yifan Zhong
Yifan Zhong
Verified email at stu.pku.edu.cn - Homepage
Title
Cited by
Cited by
Year
Safety gymnasium: A unified safe reinforcement learning benchmark
J Ji, B Zhang, J Zhou, X Pan, W Huang, R Sun, Y Geng, Y Zhong, J Dai, ...
Advances in Neural Information Processing Systems 36, 2023
582023
Heterogeneous-agent reinforcement learning
Y Zhong, JG Kuba, X Feng, S Hu, J Ji, Y Yang
Journal of Machine Learning Research 25 (1-67), 1, 2024
492024
Marllib: A scalable and efficient multi-agent reinforcement learning library
S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang
Journal of Machine Learning Research 24 (315), 1-23, 2023
232023
Panacea: Pareto alignment via preference adaptation for llms
Y Zhong, C Ma, X Zhang, Z Yang, H Chen, Q Zhang, S Qi, Y Yang
arXiv preprint arXiv:2402.02030, 2024
212024
CivRealm: A learning and reasoning odyssey in Civilization for decision-making agents
S Qi, S Chen, Y Li, X Kong, J Wang, B Yang, P Wong, Y Zhong, X Zhang, ...
arXiv preprint arXiv:2401.10568, 2024
122024
Marllib: Extending rllib for multi-agent reinforcement learning
S Hu, Y Zhong, M Gao, W Wang, H Dong, Z Li, X Liang, Y Yang, X Chang
122022
Maximum Entropy Heterogeneous-Agent Reinforcement Learning
J Liu, Y Zhong, S Hu, H Fu, Q Fu, X Chang, Y Yang
The Twelfth International Conference on Learning Representations, 0
11
MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand
V Caggiano, G Durandau, H Wang, A Chiappa, A Mathis, P Tano, N Patel, ...
NeurIPS 2022 Competition Track, 233-250, 2023
82023
Marllib: A scalable multi-agent reinforcement learning library
S Hu, Y Zhong, M Gao, W Wang, H Dong, Z Li, X Liang, X Chang, Y Yang
arXiv preprint arXiv:2210.13708, 2022
42022
Maximum Entropy Heterogeneous-Agent Mirror Learning
J Liu, Y Zhong, S Hu, H Fu, Q Fu, X Chang, Y Yang
arXiv preprint arXiv:2306.10715, 2023
22023
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
S Hu, Y Zhong, M Gao, W Wang, H Dong, X Liang, Z Li, X Chang, Y Yang
arXiv preprint arXiv:2210.13708, 2022
22022
Off-agent trust region policy optimization
R Chen, X Zhang, Y Du, Y Zhong, Z Tian, F Sun, Y Yang
International Joint Conference on Artificial Intelligence, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12