フォロー
Yifei Zhou
タイトル
引用先
引用先
Hybrid rl: Using both offline and online data can make rl efficient
Y Song, Y Zhou, A Sekhari, JA Bagnell, A Krishnamurthy, W Sun
ICLR 2023, 2022
862022
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Y Zhai, H Bai, Z Lin, J Pan, S Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ...
arXiv preprint arXiv:2405.10292, 2024
372024
Autonomous evaluation and refinement of digital agents
J Pan, Y Zhang, N Tomlin, Y Zhou, S Levine, A Suhr
arXiv preprint arXiv:2404.06474, 2024
372024
Digirl: Training in-the-wild device-control agents with autonomous reinforcement learning
H Bai, Y Zhou, M Cemri, J Pan, A Suhr, S Levine, A Kumar
arXiv preprint arXiv:2406.11896, 2024
252024
Archer: Training language model agents via hierarchical multi-turn rl
Y Zhou, A Zanette, J Pan, S Levine, A Kumar
arXiv preprint arXiv:2402.19446, 2024
242024
Test-time distribution normalization for contrastively learned visual-language models
Y Zhou, J Ren, F Li, R Zabih, SN Lim
Advances in Neural Information Processing Systems 36, 2024
20*2024
Offline data enhanced on-policy policy gradient with provable guarantees
Y Zhou, A Sekhari, Y Song, W Sun
arXiv preprint arXiv:2311.08384, 2023
72023
Improve discourse dependency parsing with contextualized representations
Y Zhou, Y Feng
ACL 2022 findings, 2022
62022
: Backward-compatible Training with Basis Transformation
Y Zhou, Z Li, A Shrivastava, H Zhao, A Torralba, T Tian, SN Lim
ICCV 2023, 2022
52022
Aligning Large Language Models with Representation Editing: A Control Perspective
L Kong, H Wang, W Mu, Y Du, Y Zhuang, Y Zhou, Y Song, R Zhang, ...
arXiv preprint arXiv:2406.05954, 2024
32024
Kalie: Fine-tuning vision-language models for open-world manipulation without robot data
G Tang, S Rajkumar, Y Zhou, HR Walke, S Levine, K Fang
arXiv preprint arXiv:2409.14066, 2024
22024
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
Y Zhou, Q Yang, K Lin, M Bai, X Zhou, YX Wang, S Levine, E Li
arXiv preprint arXiv:2412.13194, 2024
2024
Yifei Zhou
Y Zhou
University of California, Berkeley 2028, 2023
2023
GAPX: generalized autoregressive paraphrase-identification X
Y Zhou, R Li, H Housen, SN Lim
Advances in Neural Information Processing Systems 35, 2211-2225, 2022
2022
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–14