Baichuan 2: Open large-scale language models A Yang, B Xiao, B Wang, B Zhang, C Bian, C Yin, C Lv, D Pan, D Wang, ... arXiv preprint arXiv:2309.10305, 2023 | 587* | 2023 |
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset J Ji*, M Liu*, J Dai*, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang arXiv preprint arXiv:2307.04657, 2023 | 335 | 2023 |
Safe rlhf: Safe reinforcement learning from human feedback J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang arXiv preprint arXiv:2310.12773, 2023 | 258 | 2023 |
Omnisafe: An infrastructure for accelerating safe reinforcement learning research J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun, W Huang, Y Geng, M Liu, ... Journal of Machine Learning Research 25 (285), 1-6, 2024 | 49 | 2024 |
Mate: Benchmarking multi-agent reinforcement learning in distributed target coverage control X Pan, M Liu, F Zhong, Y Yang, SC Zhu, Y Wang Advances in Neural Information Processing Systems 35, 27862-27879, 2022 | 33 | 2022 |
Proactive Multi-Camera Collaboration For 3D Human Pose Estimation H Ci*, M Liu*, X Pan*, F Zhong, Y Wang The 11th International Conference on Learning Representations (ICLR), 2023 | 14 | 2023 |