Believe what you see: Implicit constraint approach for offline multi-agent reinforcement learning Y Yang, X Ma, C Li, Z Zheng, Q Zhang, G Huang, J Yang, Q Zhao (NeurIPS Spotlight) Advances in Neural Information Processing Systems 34 …, 2021 | 90 | 2021 |
Offline Reinforcement Learning with Value-based Episodic Memory X Ma, Y Yang, H Hu, Q Liu, J Yang, C Zhang, Q Zhao, B Liang (ICLR) International Conference on Learning Representations, 2021 | 47 | 2021 |
The difference learning of hidden layer between autoencoder and variational autoencoder Q Xu, Z Wu, Y Yang, L Zhang 2017 29th Chinese Control And Decision Conference (CCDC), 4801-4804, 2017 | 40 | 2017 |
Deep convolutional neural network-based autonomous marine vehicle maneuver Q Xu, Y Yang, C Zhang, L Zhang International Journal of Fuzzy Systems 20, 687-699, 2018 | 37 | 2018 |
Two-wheeled robot platform based on PID control J Meng, A Liu, Y Yang, Z Wu, Q Xu 2018 5th International Conference on Information Science and Control …, 2018 | 36 | 2018 |
A deep fully convolution neural network for semantic segmentation based on adaptive feature fusion A Liu, Y Yang, Q Sun, Q Xu 2018 5th International Conference on Information Science and Control …, 2018 | 31 | 2018 |
An automated grader for Chinese essay combining shallow and deep semantic attributes Y Yang, L Xia, Q Zhao IEEE Access 7, 176306-176316, 2019 | 26 | 2019 |
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning X Ma, Y Yang, C Li, Y Lu, Q Zhao, Y Jun (AAMAS) International Conference on Autonomous Agents and MultiAgent Systems, 2021 | 23 | 2021 |
On the Role of Discount Factor in Offline Reinforcement Learning H Hu, Y Yang, Q Zhao, C Zhang (ICML) International Conference on Machine Learning, 2022 | 19 | 2022 |
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Y Yang, H Hu, W Li, S Li, J Yang, Q Zhao, C Zhang (AAAI Oral) Association for the Advancement of Artificial Intelligence, 2022 | 14 | 2022 |
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning H Hu, Y Yang, Q Zhao, C Zhang (ICLR) International Conference on Learning Representations, 2023 | 13 | 2023 |
Deep learning technique-based steering of autonomous car Y Yang, Z Wu, Q Xu, F Yan International Journal of Computational Intelligence and Applications 17 (02 …, 2018 | 10 | 2018 |
Uac: Offline reinforcement learning with uncertain action constraint J Guan, S Gu, Z Li, J Hou, Y Yang, G Chen, C Jiang IEEE Transactions on Cognitive and Developmental Systems 16 (2), 671-680, 2023 | 8 | 2023 |
Different latent variables learning in variational autoencoder Q Xu, Y Yang, Z Wu, L Zhang 2017 4th International Conference on Information, Cybernetics and …, 2017 | 7 | 2017 |
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning D Zhong, Y Yang, Q Zhao (AAAI) Association for the Advancement of Artificial Intelligence, 2023 | 6 | 2023 |
Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning Q Zhang, X Ma, Y Yang, C Li, J Yang, Y Liu, B Liang IEEE Robotics and Automation Letters, 2021 | 6 | 2021 |
Unsupervised Behavior Extraction via Random Intent Priors H Hu, Y Yang, J Ye, Z Mai, C Zhang (NeurIPS) Advances in Neural Information Processing Systems., 2023 | 5 | 2023 |
Learning Diverse Risk Preferences in Population-based Self-play Y Jiang, Q Liu, X Ma, C Li, Y Yang, J Yang, B Liang, Q Zhao (AAAI) Association for the Advancement of Artificial Intelligence, 2023 | 4 | 2023 |
Bayesian Design Principles for Offline-to-Online Reinforcement Learning H Hu, Y Yang, J Ye, C Wu, Z Mai, Y Hu, T Lv, C Fan, Q Zhao, C Zhang (ICML) International Conference on Machine Learning, 2024 | 2 | 2024 |
Episodic Novelty Through Temporal Distance Y Jiang, Q Liu, Y Yang, X Ma, D Zhong, H Hu, J Yang, B Liang, B Xu, ... (ICLR) International Conference on Learning Representations, 2025 | | 2025 |