Yifei Zhou

引用先

	すべて	2020 年以来
引用	252	252
h 指標	7	7
i10 指標	6	6

200

100

150

20222023202420253 35 183 31

共著者

Sergey LevineUC Berkeley, Physical Intelligence確認したメールアドレス: eecs.berkeley.edu
Jiayi PanUniversity of California, Berkeley確認したメールアドレス: berkeley.edu
Wen SunAssistant Professor, Cornell University確認したメールアドレス: cornell.edu
Yuda SongCarnegie Mellon University確認したメールアドレス: andrew.cmu.edu
Ayush SekhariPostdoctoral Associate, MIT確認したメールアドレス: mit.edu
Aviral KumarCMU & Google DeepMind確認したメールアドレス: andrew.cmu.edu
Sernam LimAssociate Professor, CS, University of Central Florida確認したメールアドレス: ucf.edu
Yuexiang ZhaiUC Berkeley | Google DeepMind確認したメールアドレス: berkeley.edu
Andrea ZanetteAssistant Professor, Carnegie Mellon University確認したメールアドレス: andrew.cmu.edu
Zilu LiPhD Student at UCSD確認したメールアドレス: cornell.edu

フォロー

Yifei Zhou

UC Berkeley

確認したメールアドレス: berkeley.edu - ホームページ

Machine Learning Natural Language Processing Reinforcement Learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Hybrid rl: Using both offline and online data can make rl efficient Y Song, Y Zhou, A Sekhari, JA Bagnell, A Krishnamurthy, W Sun ICLR 2023, 2022	86	2022
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Y Zhai, H Bai, Z Lin, J Pan, S Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ... arXiv preprint arXiv:2405.10292, 2024	37	2024
Autonomous evaluation and refinement of digital agents J Pan, Y Zhang, N Tomlin, Y Zhou, S Levine, A Suhr arXiv preprint arXiv:2404.06474, 2024	37	2024
Digirl: Training in-the-wild device-control agents with autonomous reinforcement learning H Bai, Y Zhou, M Cemri, J Pan, A Suhr, S Levine, A Kumar arXiv preprint arXiv:2406.11896, 2024	25	2024
Archer: Training language model agents via hierarchical multi-turn rl Y Zhou, A Zanette, J Pan, S Levine, A Kumar arXiv preprint arXiv:2402.19446, 2024	24	2024
Test-time distribution normalization for contrastively learned visual-language models Y Zhou, J Ren, F Li, R Zabih, SN Lim Advances in Neural Information Processing Systems 36, 2024	20*	2024
Offline data enhanced on-policy policy gradient with provable guarantees Y Zhou, A Sekhari, Y Song, W Sun arXiv preprint arXiv:2311.08384, 2023	7	2023
Improve discourse dependency parsing with contextualized representations Y Zhou, Y Feng ACL 2022 findings, 2022	6	2022
: Backward-compatible Training with Basis Transformation Y Zhou, Z Li, A Shrivastava, H Zhao, A Torralba, T Tian, SN Lim ICCV 2023, 2022	5	2022
Aligning Large Language Models with Representation Editing: A Control Perspective L Kong, H Wang, W Mu, Y Du, Y Zhuang, Y Zhou, Y Song, R Zhang, ... arXiv preprint arXiv:2406.05954, 2024	3	2024
Kalie: Fine-tuning vision-language models for open-world manipulation without robot data G Tang, S Rajkumar, Y Zhou, HR Walke, S Levine, K Fang arXiv preprint arXiv:2409.14066, 2024	2	2024
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Y Zhou, Q Yang, K Lin, M Bai, X Zhou, YX Wang, S Levine, E Li arXiv preprint arXiv:2412.13194, 2024		2024
Yifei Zhou Y Zhou University of California, Berkeley 2028, 2023		2023
GAPX: generalized autoregressive paraphrase-identification X Y Zhou, R Li, H Housen, SN Lim Advances in Neural Information Processing Systems 35, 2211-2225, 2022		2022

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–14

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者