Yifei Zhou

Citirano

	Sve	Od 2020.
Citati	273	273
H-indeks	7	7
i10-indeks	6	6

200

100

150

20222023202420253 42 182 44

Suautori

Sergey LevineUC Berkeley, Physical IntelligencePotvrđena adresa e-pošte na eecs.berkeley.edu
Jiayi PanUniversity of California, BerkeleyPotvrđena adresa e-pošte na berkeley.edu
Wen SunAssistant Professor, Cornell UniversityPotvrđena adresa e-pošte na cornell.edu
Yuda SongCarnegie Mellon UniversityPotvrđena adresa e-pošte na andrew.cmu.edu
Ayush SekhariPostdoctoral Associate, MITPotvrđena adresa e-pošte na mit.edu
Aviral KumarCMU & Google DeepMindPotvrđena adresa e-pošte na andrew.cmu.edu
Sernam LimAssociate Professor, CS, University of Central FloridaPotvrđena adresa e-pošte na ucf.edu
Yuexiang ZhaiUC Berkeley | Google DeepMindPotvrđena adresa e-pošte na berkeley.edu
Andrea ZanetteAssistant Professor, Carnegie Mellon UniversityPotvrđena adresa e-pošte na andrew.cmu.edu
Zilu LiPhD Student at UCSDPotvrđena adresa e-pošte na cornell.edu

Prati

Yifei Zhou

UC Berkeley

Potvrđena adresa e-pošte na berkeley.edu - Početna stranica

Machine Learning Natural Language Processing Reinforcement Learning


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
Hybrid rl: Using both offline and online data can make rl efficient Y Song, Y Zhou, A Sekhari, JA Bagnell, A Krishnamurthy, W Sun ICLR 2023, 2022	93	2022
Autonomous evaluation and refinement of digital agents J Pan, Y Zhang, N Tomlin, Y Zhou, S Levine, A Suhr arXiv preprint arXiv:2404.06474, 2024	41	2024
Fine-tuning large vision-language models as decision-making agents via reinforcement learning S Zhai, H Bai, Z Lin, J Pan, P Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ... Advances in Neural Information Processing Systems 37, 110935-110971, 2025	38	2025
Archer: Training language model agents via hierarchical multi-turn rl Y Zhou, A Zanette, J Pan, S Levine, A Kumar arXiv preprint arXiv:2402.19446, 2024	28	2024
Digirl: Training in-the-wild device-control agents with autonomous reinforcement learning H Bai, Y Zhou, M Cemri, J Pan, A Suhr, S Levine, A Kumar arXiv preprint arXiv:2406.11896, 2024	27	2024
Test-time distribution normalization for contrastively learned visual-language models Y Zhou, J Ren, F Li, R Zabih, SN Lim Advances in Neural Information Processing Systems 36, 47105-47123, 2023	19	2023
Offline data enhanced on-policy policy gradient with provable guarantees Y Zhou, A Sekhari, Y Song, W Sun arXiv preprint arXiv:2311.08384, 2023	9	2023
Improve discourse dependency parsing with contextualized representations Y Zhou, Y Feng ACL 2022 findings, 2022	6	2022
: Backward-compatible Training with Basis Transformation Y Zhou, Z Li, A Shrivastava, H Zhao, A Torralba, T Tian, SN Lim ICCV 2023, 2022	5	2022
Kalie: Fine-tuning vision-language models for open-world manipulation without robot data G Tang, S Rajkumar, Y Zhou, HR Walke, S Levine, K Fang arXiv preprint arXiv:2409.14066, 2024	4	2024
Aligning Large Language Models with Representation Editing: A Control Perspective L Kong, H Wang, W Mu, Y Du, Y Zhuang, Y Zhou, Y Song, R Zhang, ... Advances in Neural Information Processing Systems 37, 37356-37384, 2025	3	2025
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Y Zhou, Q Yang, K Lin, M Bai, X Zhou, YX Wang, S Levine, E Li arXiv preprint arXiv:2412.13194, 2024		2024
Yifei Zhou Y Zhou University of California, Berkeley 2028, 2023		2023
GAPX: generalized autoregressive paraphrase-identification X Y Zhou, R Li, H Housen, SN Lim Advances in Neural Information Processing Systems 35, 2211-2225, 2022		2022

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–14

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori