Towards efficient llm grounding for embodied multi-agent collaboration Y Zhang, S Yang, C Bai, F Wu, X Li, Z Wang, X Li arXiv preprint arXiv:2405.14314, 2024 | 15 | 2024 |
Contrastive representation for data filtering in cross-domain offline reinforcement learning X Wen, C Bai, K Xu, X Yu, Y Zhang, X Li, Z Wang arXiv preprint arXiv:2405.06192, 2024 | 4 | 2024 |
Multi-agent Exploration with Sub-state Entropy Estimation J Tao, Y Chen, Y Zhang, K Yang, X Li 2024 International Joint Conference on Neural Networks (IJCNN), 1-9, 2024 | 2 | 2024 |
Online Preference Alignment for Language Models via Count-based Exploration C Bai, Y Zhang, S Qiu, Q Zhang, K Xu, X Li arXiv preprint arXiv:2501.12735, 2025 | 1 | 2025 |
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models Y Zhang, C Bai, B Zhao, J Yan, X Li, X Li arXiv preprint arXiv:2406.15836, 2024 | 1 | 2024 |
GTLMA: Generalizable Hierarchical Learning for Tasks with Variable Entities K Yang, A Gong, J Tao, Y Zhang, X Li 2023 International Conference on Frontiers of Robotics and Software …, 2023 | 1 | 2023 |
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner C Fan, C Bai, Z Shan, H He, Y Zhang, Z Wang arXiv preprint arXiv:2409.19949, 2024 | | 2024 |