QPLEX: Duplex Dueling Multi-Agent Q-Learning J Wang, Z Ren, T Liu, Y Yu, C Zhang Ninth International Conference on Learning Representations (ICLR 2021), 2021 | 546 | 2021 |
Influence-Based Multi-Agent Exploration T Wang, J Wang, Y Wu, C Zhang Eighth International Conference on Learning Representations (ICLR2020 Spolight), 2020 | 176 | 2020 |
Learning Nearly Decomposable Value Functions via Communication Minimization T Wang, J Wang, C Zheng, C Zhang Eighth International Conference on Learning Representations (ICLR 2020), 2020 | 162 | 2020 |
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration L Zheng, J Chen, J Wang, J He, Y Hu, Y Chen, C Fan, Y Gao, C Zhang Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 | 93 | 2021 |
Multi-Agent Incentive Communication via Decentralized Teammate Modeling L Yuan, J Wang, F Zhang, C Wang, Z Zhang, Y Yu, C Zhang Thirty-sixth AAAI Conference on Artificial Intelligence (AAAI 2022), 2022 | 67 | 2022 |
Offline Reinforcement Learning with Reverse Model-based Imagination J Wang, W Li, H Jiang, G Zhu, S Li, C Zhang Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 | 64 | 2021 |
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization J Wang, Z Ren, B Han, J Ye, C Zhang Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021 | 62* | 2021 |
Learning Subgoal Representations with Slow Dynamics S Li, L Zheng, J Wang, C Zhang Ninth International Conference on Learning Representations (ICLR 2021), 2021 | 56 | 2021 |
Metacure: Meta Reinforcement Learning with Empowerment-driven Exploration J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang Thirty-eighth International Conference on Machine Learning (ICML 2021), 2021 | 50* | 2021 |
Active Hierarchical Exploration with Stable Subgoal Representation Learning S Li, J Zhang, J Wang, Y Yu, C Zhang Tenth International Conference on Learning Representations (ICLR 2022), 2022 | 36* | 2022 |
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL X Chen, A Ghadirzadeh, T Yu, Y Gao, J Wang, W Li, B Liang, C Finn, ... Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), 2022 | 29 | 2022 |
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates J Cao, L Yuan, J Wang, S Zhang, C Zhang, Y Yu, DC Zhan SCIENCE CHINA Information Sciences, 2022, 2021 | 23 | 2021 |
Multi-Agent Concentrative Coordination with Decentralized Task Representation L Yuan, C Wang, J Wang, F Zhang, F Chen, C Guan, Z Zhang, C Zhang, ... 31st International Joint Conference on Artificial Intelligence (IJCAI 2022), 2022 | 18 | 2022 |
Self-Organized Polynomial-Time Coordination Graphs Q Yang, W Dong, Z Ren, J Wang, T Wang, C Zhang Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022 | 14 | 2022 |
Object-Oriented Dynamics Learning through Multi-Level Abstraction G Zhu, J Wang, Z Ren, Z Lin, C Zhang Thirty-fourth AAAI Conference on Artificial Intelligence (AAAI 2020), 2020 | 12 | 2020 |
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation J Wang, J Zhang, H Jiang, J Zhang, L Wang, C Zhang Fortieth International Conference on Machine Learning (ICML 2023), 2023 | 11 | 2023 |
Towards Global Optimality in Cooperative MARL with Sequential Transformation J Ye, C Li, J Wang, C Zhang arXiv preprint arXiv:2207.11143, 2022 | 7 | 2022 |
Offline Communication Learning with Multi-source Datasets Y Mao, R Hu, L Zheng, J Wang, C Zhang | | 2023 |