Prati
Yifei Zhou
Naslov
Citirano
Citirano
Godina
Hybrid rl: Using both offline and online data can make rl efficient
Y Song, Y Zhou, A Sekhari, JA Bagnell, A Krishnamurthy, W Sun
ICLR 2023, 2022
932022
Autonomous evaluation and refinement of digital agents
J Pan, Y Zhang, N Tomlin, Y Zhou, S Levine, A Suhr
arXiv preprint arXiv:2404.06474, 2024
412024
Fine-tuning large vision-language models as decision-making agents via reinforcement learning
S Zhai, H Bai, Z Lin, J Pan, P Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ...
Advances in Neural Information Processing Systems 37, 110935-110971, 2025
382025
Archer: Training language model agents via hierarchical multi-turn rl
Y Zhou, A Zanette, J Pan, S Levine, A Kumar
arXiv preprint arXiv:2402.19446, 2024
282024
Digirl: Training in-the-wild device-control agents with autonomous reinforcement learning
H Bai, Y Zhou, M Cemri, J Pan, A Suhr, S Levine, A Kumar
arXiv preprint arXiv:2406.11896, 2024
272024
Test-time distribution normalization for contrastively learned visual-language models
Y Zhou, J Ren, F Li, R Zabih, SN Lim
Advances in Neural Information Processing Systems 36, 47105-47123, 2023
192023
Offline data enhanced on-policy policy gradient with provable guarantees
Y Zhou, A Sekhari, Y Song, W Sun
arXiv preprint arXiv:2311.08384, 2023
92023
Improve discourse dependency parsing with contextualized representations
Y Zhou, Y Feng
ACL 2022 findings, 2022
62022
: Backward-compatible Training with Basis Transformation
Y Zhou, Z Li, A Shrivastava, H Zhao, A Torralba, T Tian, SN Lim
ICCV 2023, 2022
52022
Kalie: Fine-tuning vision-language models for open-world manipulation without robot data
G Tang, S Rajkumar, Y Zhou, HR Walke, S Levine, K Fang
arXiv preprint arXiv:2409.14066, 2024
42024
Aligning Large Language Models with Representation Editing: A Control Perspective
L Kong, H Wang, W Mu, Y Du, Y Zhuang, Y Zhou, Y Song, R Zhang, ...
Advances in Neural Information Processing Systems 37, 37356-37384, 2025
32025
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
Y Zhou, Q Yang, K Lin, M Bai, X Zhou, YX Wang, S Levine, E Li
arXiv preprint arXiv:2412.13194, 2024
2024
Yifei Zhou
Y Zhou
University of California, Berkeley 2028, 2023
2023
GAPX: generalized autoregressive paraphrase-identification X
Y Zhou, R Li, H Housen, SN Lim
Advances in Neural Information Processing Systems 35, 2211-2225, 2022
2022
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–14