Obserwuj
Xinyang Geng
Xinyang Geng
Zweryfikowany adres z google.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
9662024
Real-time user-guided image colorization with learned deep priors
R Zhang, JY Zhu, P Isola, X Geng, AS Lin, T Yu, AA Efros
arXiv preprint arXiv:1705.02999, 2017
8212017
Automatic Goal Generation for Reinforcement Learning Agents
C Florensa, D Held, X Geng, P Abbeel
International Conference on Machine Learning, 1514-1523, 2018
592*2018
Open x-embodiment: Robotic learning datasets and rt-x models
A O'Neill, A Rehman, A Gupta, A Maddukuri, A Gupta, A Padalkar, A Lee, ...
arXiv preprint arXiv:2310.08864, 2023
439*2023
Koala: A dialogue model for academic research
X Geng, A Gudibande, H Liu, E Wallace, P Abbeel, S Levine, D Song
Blog post, April 1, 6, 2023
2162023
The false promise of imitating proprietary llms
A Gudibande, E Wallace, C Snell, X Geng, H Liu, P Abbeel, S Levine, ...
arXiv preprint arXiv:2305.15717, 2023
1642023
Sequential modeling enables scalable learning for large vision models
Y Bai, X Geng, K Mangalam, A Bar, AL Yuille, T Darrell, J Malik, AA Efros
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1452024
Deep reinforcement learning for tensegrity robot locomotion
M Zhang, X Geng, J Bruce, K Caluwaerts, M Vespignani, V SunSpiral, ...
2017 IEEE international conference on robotics and automation (ICRA), 634-641, 2017
1312017
Multimodal masked autoencoders learn transferable representations
X Geng, H Liu, L Lee, D Schuurmans, S Levine, P Abbeel
arXiv preprint arXiv:2205.14204, 2022
1092022
OpenLLaMA: An open reproduction of llama
X Geng, H Liu
https://github.com/openlm-research/open_llama, 2023
1062023
Rewriting history with inverse rl: Hindsight inference for policy improvement
B Eysenbach, X Geng, S Levine, RR Salakhutdinov
Advances in neural information processing systems 33, 14783-14795, 2020
1062020
Conservative objective models for effective offline model-based optimization
B Trabucco, A Kumar, X Geng, S Levine
International Conference on Machine Learning, 10358-10368, 2021
1032021
Design-bench: Benchmarks for data-driven offline model-based optimization
B Trabucco, X Geng, A Kumar, S Levine
International Conference on Machine Learning, 21658-21676, 2022
972022
Dynamical distance learning for semi-supervised and unsupervised skill discovery
K Hartikainen, X Geng, T Haarnoja, S Levine
arXiv preprint arXiv:1907.08225, 2019
972019
Offline q-learning on diverse multi-task data both scales and generalizes
A Kumar, R Agarwal, X Geng, G Tucker, S Levine
arXiv preprint arXiv:2211.15144, 2022
502022
Meta-reinforcement learning robust to distributional shift via model identification and experience relabeling
R Mendonca, X Geng, C Finn, S Levine
arXiv preprint arXiv:2006.07178, 2020
482020
Multi-stage cable routing through hierarchical imitation learning
J Luo, C Xu, X Geng, G Feng, K Fang, L Tan, S Schaal, S Levine
IEEE Transactions on Robotics, 2024
382024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
A Setlur, S Garg, X Geng, N Garg, V Smith, A Kumar
arXiv preprint arXiv:2406.14532, 2024
252024
Dynamical distance learning for unsupervised and semi-supervised skill discovery
K Hartikainen, X Geng, T Haarnoja, S Levine
arXiv preprint arXiv:1907.08225, 2019
202019
Rewarding progress: Scaling automated process verifiers for llm reasoning
A Setlur, C Nagpal, A Fisch, X Geng, J Eisenstein, R Agarwal, A Agarwal, ...
arXiv preprint arXiv:2410.08146, 2024
192024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20