Follow
Pushi Zhang
Pushi Zhang
Other names张蒲石
Microsoft Research
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Distributional reinforcement learning for multi-dimensional reward functions
P Zhang, X Chen, L Zhao, W Xiong, T Qin, TY Liu
Advances in Neural Information Processing Systems 34, 1519-1529, 2021
242021
Distributional pareto-optimal multi-objective reinforcement learning
XQ Cai, P Zhang, L Zhao, J Bian, M Sugiyama, A Llorens
Advances in Neural Information Processing Systems 36, 15593-15613, 2023
142023
An adaptive deep rl method for non-stationary environments with piecewise stable context
X Chen, X Zhu, Y Zheng, P Zhang, L Zhao, W Cheng, P Cheng, Y Xiong, ...
Advances in Neural Information Processing Systems 35, 35449-35461, 2022
112022
Demonstration actor critic
G Liu, L Zhao, P Zhang, J Bian, T Qin, N Yu, TY Liu
Neurocomputing 434, 194-202, 2021
102021
Asking Before Acting: Gather Information in Embodied Decision Making with Language Models
X Chen, S Zhang, P Zhang, L Zhao, J Chen
arXiv preprint arXiv:2305.15695, 2023
82023
Igor: Image-goal representations are the atomic control units for foundation models in embodied ai
X Chen, J Guo, T He, C Zhang, P Zhang, DC Yang, L Zhao, J Bian
arXiv preprint arXiv:2411.00785, 2024
42024
Independence-aware Advantage Estimation
LZ Pushi Zhang, G Liu, J Bian, M Huang, T Qin, TY Liu
Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021
4*2021
Preference-conditioned Pixel-based AI Agent For Game Testing
S Abdelfattah, A Brown, P Zhang
2023 IEEE Conference on Games (CoG), 1-8, 2023
12023
IG-Net: Image-Goal Network for Offline Visual Navigation on A Large-Scale Game Map
P Zhang, B Zhu, XQ Cai, L Zhao, M Sugiyama, J Bian
2023
The system can't perform the operation now. Try again later.
Articles 1–9