Spremljaj
Xidong Feng
Xidong Feng
Google DeepMind
Preverjeni e-poštni naslov na google.com - Domača stran
Naslov
Navedeno
Navedeno
Leto
Alphazero-like tree-search can guide large language model decoding and training
X Feng, Z Wan, M Wen, SM McAleer, Y Wen, W Zhang, J Wang
Forty-first International Conference on Machine Learning, 2024
1152024
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ...
NeurIPS 2022, 2022
1052022
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl
JG Kuba*, X Feng*, S Ding, H Dong, J Wang, Y Yang
JMLR, 2022
74*2022
Vehicle trajectory prediction using intention-based conditional variational autoencoder
X Feng, Z Cen, J Hu, Y Zhang
2019 IEEE Intelligent Transportation Systems Conference (ITSC), 3514-3519, 2019
672019
Towards effective context for meta-reinforcement learning: an approach based on contrastive learning
H Fu, H Tang, J Hao, C Chen, X Feng, D Li, W Liu
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7457-7465, 2021
532021
Neural Auto-Curricula
X Feng*, O Slumbers*, Y Yang, Z Wan, B Liu, S McAleer, Y Wen, J Wang
NeurIPS 2021, 2021
51*2021
ChessGPT: Bridging Policy Learning and Language Modeling
X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang
Advances in Neural Information Processing Systems 36, 2024
492024
Mri reconstruction with interpretable pixel-wise operations using reinforcement learning
W Li*, X Feng*, H An, XY Ng, YJ Zhang
Proceedings of the AAAI conference on artificial intelligence 34 (01), 792-799, 2020
372020
CMML: Contextual modulation meta learning for cold-start recommendation
X Feng, C Chen, D Li, M Zhao, J Hao, J Wang
Proceedings of the 30th ACM international conference on information …, 2021
332021
Pangu-agent: A fine-tunable generalist agent with structured reasoning
F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ...
arXiv preprint arXiv:2312.14878, 2023
182023
Torchopt: An efficient library for differentiable optimization
J Ren*, X Feng*, B Liu*, X Pan*, Y Fu, L Mai, Y Yang
JMLR Open Source Software, 2022
162022
Uncertainty of thoughts: Uncertainty-aware planning enhances information seeking in LLMs
Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu, J He, PWW Koh, B Hooi
Advances in Neural Information Processing Systems 37, 24181-24215, 2025
15*2025
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
X Feng*, B Liu*, J Ren, L Mai, R Zhu, J Wang, Y Yang
NeurIPS 2022, 2021
14*2021
Autonomous lane change decision making using different deep reinforcement learning methods
X Feng, J Hu, Y Huo, Y Zhang
CICTP 2019, 5563-5575, 2019
112019
Contextual Transformer for Offline Meta Reinforcement Learning
R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang
NeurIPS2022 FMDM workshop, 2022
102022
MANSA: Learning fast and slow in multi-agent systems
DH Mguni, H Chen, T Jafferjee, J Wang, L Yue, X Feng, SM Mcaleer, ...
International Conference on Machine Learning, 24631-24658, 2023
62023
Natural language reinforcement learning
X Feng, Z Wan, H Fu, B Liu, M Yang, GA Koushik, Z Hu, Y Wen, J Wang
arXiv preprint arXiv:2411.14251, 2024
32024
Efficient Reinforcement Learning with Large Language Model Priors
X Yan, Y Song, X Feng, M Yang, H Zhang, HB Ammar, J Wang
arXiv preprint arXiv:2410.07927, 2024
22024
World Models: Understanding, Modelling and Scaling
M Yang, H Li, F Laakom, X Feng, J Shi, Z Li, F Faccio, J Schmidhuber
ICLR 2025 Workshop Proposals, 0
Workshop on Reasoning and Planning for Large Language Models
Z Hu, Y Zhao, X Feng, MY Kan, N Dziri, Y Du, PW Koh, B Hooi, A Cohan
ICLR 2025 Workshop Proposals, 0
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–20