Suivre
Xidong Feng
Titre
Citée par
Citée par
Année
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ...
NeurIPS 2022, 2022
1042022
Alphazero-like tree-search can guide large language model decoding and training
X Feng, Z Wan, M Wen, SM McAleer, Y Wen, W Zhang, J Wang
Forty-first International Conference on Machine Learning, 2024
862024
Vehicle trajectory prediction using intention-based conditional variational autoencoder
X Feng, Z Cen, J Hu, Y Zhang
2019 IEEE Intelligent Transportation Systems Conference (ITSC), 3514-3519, 2019
672019
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl
JG Kuba*, X Feng*, S Ding, H Dong, J Wang, Y Yang
JMLR, 2022
63*2022
Towards effective context for meta-reinforcement learning: an approach based on contrastive learning
H Fu, H Tang, J Hao, C Chen, X Feng, D Li, W Liu
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7457-7465, 2021
532021
Neural Auto-Curricula
X Feng*, O Slumbers*, Y Yang, Z Wan, B Liu, S McAleer, Y Wen, J Wang
NeurIPS 2021, 2021
52*2021
ChessGPT: Bridging Policy Learning and Language Modeling
X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang
Advances in Neural Information Processing Systems 36, 2024
412024
Mri reconstruction with interpretable pixel-wise operations using reinforcement learning
W Li*, X Feng*, H An, XY Ng, YJ Zhang
Proceedings of the AAAI conference on artificial intelligence 34 (01), 792-799, 2020
352020
CMML: Contextual modulation meta learning for cold-start recommendation
X Feng, C Chen, D Li, M Zhao, J Hao, J Wang
Proceedings of the 30th ACM International Conference on Information …, 2021
312021
Pangu-agent: A fine-tunable generalist agent with structured reasoning
F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ...
arXiv preprint arXiv:2312.14878, 2023
172023
Torchopt: An efficient library for differentiable optimization
J Ren*, X Feng*, B Liu*, X Pan*, Y Fu, L Mai, Y Yang
JMLR Open Source Software, 2022
132022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
X Feng*, B Liu*, J Ren, L Mai, R Zhu, J Wang, Y Yang
NeurIPS 2022, 2021
13*2021
Contextual Transformer for Offline Meta Reinforcement Learning
R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang
NeurIPS2022 FMDM workshop, 2022
112022
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs
Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu, J He, PW Koh, B Hooi
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
10*2024
Autonomous lane change decision making using different deep reinforcement learning methods
X Feng, J Hu, Y Huo, Y Zhang
CICTP 2019, 5563-5575, 2019
102019
Mansa: Learning fast and slow in multi-agent systems
DH Mguni, H Chen, T Jafferjee, J Wang, L Yue, X Feng, SM Mcaleer, ...
International Conference on Machine Learning, 24631-24658, 2023
62023
Natural language reinforcement learning
X Feng, Z Wan, H Fu, B Liu, M Yang, GA Koushik, Z Hu, Y Wen, J Wang
arXiv preprint arXiv:2411.14251, 2024
22024
Efficient Reinforcement Learning with Large Language Model Priors
X Yan, Y Song, X Feng, M Yang, H Zhang, HB Ammar, J Wang
arXiv preprint arXiv:2410.07927, 2024
22024
World Models: Understanding, Modelling and Scaling
M Yang, H Li, F Laakom, X Feng, J Shi, Z Li, F Faccio, J Schmidhuber
ICLR 2025 Workshop Proposals, 0
Workshop on Reasoning and Planning for Large Language Models
Z Hu, Y Zhao, X Feng, MY Kan, N Dziri, Y Du, PW Koh, B Hooi, A Cohan
ICLR 2025 Workshop Proposals, 0
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20