Volgen
Xinyang Geng
Xinyang Geng
Geverifieerd e-mailadres voor google.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
11202024
Real-time user-guided image colorization with learned deep priors
R Zhang, JY Zhu, P Isola, X Geng, AS Lin, T Yu, AA Efros
arXiv preprint arXiv:1705.02999, 2017
8252017
Automatic Goal Generation for Reinforcement Learning Agents
C Florensa, D Held, X Geng, P Abbeel
International Conference on Machine Learning, 1514-1523, 2018
593*2018
Open x-embodiment: Robotic learning datasets and rt-x models
JJ Lim
IEEE International Conference on Robotics and Automation, 2024
463*2024
Koala: A dialogue model for academic research
X Geng, A Gudibande, H Liu, E Wallace, P Abbeel, S Levine, D Song
Blog post, April 1 (6), 2023
2262023
The false promise of imitating proprietary llms
A Gudibande, E Wallace, C Snell, X Geng, H Liu, P Abbeel, S Levine, ...
arXiv preprint arXiv:2305.15717, 2023
1682023
OpenLLaMA: An open reproduction of llama
X Geng, H Liu
https://github.com/openlm-research/open_llama, 2023
1622023
Sequential modeling enables scalable learning for large vision models
Y Bai, X Geng, K Mangalam, A Bar, AL Yuille, T Darrell, J Malik, AA Efros
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1552024
Deep reinforcement learning for tensegrity robot locomotion
M Zhang, X Geng, J Bruce, K Caluwaerts, M Vespignani, V SunSpiral, ...
2017 IEEE international conference on robotics and automation (ICRA), 634-641, 2017
1312017
Multimodal masked autoencoders learn transferable representations
X Geng, H Liu, L Lee, D Schuurmans, S Levine, P Abbeel
arXiv preprint arXiv:2205.14204, 2022
1092022
Rewriting history with inverse rl: Hindsight inference for policy improvement
B Eysenbach, X Geng, S Levine, RR Salakhutdinov
Advances in neural information processing systems 33, 14783-14795, 2020
1092020
Conservative objective models for effective offline model-based optimization
B Trabucco, A Kumar, X Geng, S Levine
International Conference on Machine Learning, 10358-10368, 2021
1022021
Design-bench: Benchmarks for data-driven offline model-based optimization
B Trabucco, X Geng, A Kumar, S Levine
International Conference on Machine Learning, 21658-21676, 2022
1002022
Dynamical distance learning for semi-supervised and unsupervised skill discovery
K Hartikainen, X Geng, T Haarnoja, S Levine
arXiv preprint arXiv:1907.08225, 2019
982019
Offline q-learning on diverse multi-task data both scales and generalizes
A Kumar, R Agarwal, X Geng, G Tucker, S Levine
arXiv preprint arXiv:2211.15144, 2022
522022
Meta-reinforcement learning robust to distributional shift via model identification and experience relabeling
R Mendonca, X Geng, C Finn, S Levine
arXiv preprint arXiv:2006.07178, 2020
492020
Multistage cable routing through hierarchical imitation learning
J Luo, C Xu, X Geng, G Feng, K Fang, L Tan, S Schaal, S Levine
IEEE Transactions on Robotics 40, 1476-1491, 2024
372024
Rl on incorrect synthetic data scales the efficiency of llm math reasoning by eight-fold
A Setlur, S Garg, X Geng, N Garg, V Smith, A Kumar
Advances in Neural Information Processing Systems 37, 43000-43031, 2025
292025
Rewarding progress: Scaling automated process verifiers for llm reasoning
A Setlur, C Nagpal, A Fisch, X Geng, J Eisenstein, R Agarwal, A Agarwal, ...
arXiv preprint arXiv:2410.08146, 2024
262024
Action-quantized offline reinforcement learning for robotic skill learning
J Luo, P Dong, J Wu, A Kumar, X Geng, S Levine
Conference on Robot Learning, 1348-1361, 2023
202023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20