Exploration in Deep Reinforcement Learning: From Single-Agent to Multi-Agent Domain J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 242* | 2023 |
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang International Conference on Learning representations (ICLR), 2022 | 161 | 2022 |
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing R Yang, C Bai, X Ma, Z Wang, C Zhang, L Han Neural Information Processing Systems (NeurIPS), 2022 | 84 | 2022 |
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning H He, C Bai, K Xu, Z Yang, W Zhang, D Wang, B Zhao, X Li Neural Information Processing Systems (NeurIPS), 2023 | 73 | 2023 |
Principled Exploration via Optimistic Bootstrapping and Backward Induction C Bai, L Wang, L Han, J Hao, A Garg, P Liu, Z Wang International Conference on Machine Learning (ICML), 2021 | 46 | 2021 |
Survey on Sparse Reward in Deep Reinforcement Learning W Yang, C Bai, C Cai, Y Zhao, P Liu 计算机科学 47 (3), 182-191, 2020 | 46* | 2020 |
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning S Qiu, L Wang, C Bai, Z Yang, Z Wang International Conference on Machine Learning (ICML), 18168-18210, 2022 | 39 | 2022 |
Dynamic Bottleneck for Robust Self-Supervised Exploration C Bai, L Wang, L Han, A Garg, J Hao, P Liu, Z Wang Neural Information Processing Systems (NeurIPS), 2021 | 33 | 2021 |
Guided Goal Generation for Hindsight Multi-Goal Reinforcement Learning C Bai, P Liu, W Zhao, X Tang Neurocomputing 359, 353-367, 2019 | 25 | 2019 |
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning C Bai, P Liu, K Liu, L Wang, Y Zhao, L Han, Z Wang IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021 | 19 | 2021 |
False Correlation Reduction for Offline Reinforcement Learning Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 17* | 2023 |
Addressing Hindsight Bias in Multi-Goal Reinforcement Learning C Bai, L Wang, Y Wang, Z Wang, R Zhao, C Bai, P Liu IEEE Transactions on Cybernetics, 2021 | 17 | 2021 |
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration Y Zhang, S Yang, C Bai, F Wu, X Li, Z Wang, X Li arXiv preprint arXiv:2405.14314, 2024 | 15 | 2024 |
Cross-Domain Policy Adaptation via Value-Guided Data Filtering K Xu, C Bai, X Ma, D Wang, B Zhao, Z Wang, X Li, W Li Neural Information Processing Systems (NeurIPS), 2023 | 13 | 2023 |
Behavior Contrastive Learning for Unsupervised Skill Discovery R Yang, C Bai, H Guo, S Li, B Zhao, Z Wang, P Liu, X Li International Conference on Machine Learning (ICML), 2023 | 13 | 2023 |
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning C Bai, T Xiao, Z Zhu, L Wang, F Zhou, A Garg, B He, P Liu, Z Wang IEEE Transactions on Neural Networks and Learning Systems, 2022 | 13 | 2022 |
Active Sampling for Deep Q-learning Based on TD-error Adaptive Correction C Bai, P Liu, W Zhao, X Tang 计算机研究与发展 56 (2), 262-280, 2019 | 12* | 2019 |
Generating Attentive Goals for Prioritized Hindsight Reinforcement Learning P Liu, C Bai, Y Zhao, C Bai, W Zhao, X Tang Knowledge-Based Systems 203, 106140, 2020 | 11 | 2020 |
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training H He, C Bai, L Pan, W Zhang, B Zhao, X Li Neural Information Processing Systems (NeurIPS), 2024 | 10* | 2024 |
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning J Shi, C Bai, H He, L Han, D Wang, B Zhao, X Li, X Li IEEE International Conference on Robotics and Automation (ICRA), 2024 | 10 | 2024 |