Sledovať
Yang Yue(乐洋)
Yang Yue(乐洋)
Overená e-mailová adresa na: mails.tsinghua.edu.cn - Domovská stránka
Názov
Citované v
Citované v
Rok
Boosting Offline Reinforcement Learning via Data Rebalancing
Y Yue, B Kang, X Ma, Z Xu, G Huang, S Yan
NeurIPS 2022, offline RL workshop, 2022
182022
Decoupled Prioritized Resampling for Offline RL
Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan
IEEE Transactions on Neural Networks and Learning Systems, 2023
16*2023
How Far is Video Generation from World Model: A Physical Law Perspective
B Kang*, Y Yue* (Equal contribution in alphabetical order), R Lu, Z Lin, ...
arXiv preprint arXiv:2411.02385, 2024
122024
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Y Yue*, R Lu*, B Kang*, S Song, G Huang
Neural Information Processing Systems (NeurIPS) 2023, 2023
122023
Value-consistent representation learning for data-efficient reinforcement learning
Y Yue, B Kang, Z Xu, G Huang, S Yan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 11069 …, 2023
112023
Improving and benchmarking offline reinforcement learning algorithms
B Kang, X Ma, Y Wang, Y Yue, S Yan
arXiv preprint arXiv:2306.00972, 2023
72023
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
H Wang*, Y Yue* (*Equal contribution), R Lu, J Shi, A Zhao, S Wang, ...
North American Chapter of the Association for Computational Linguistics 2025, 2024
52024
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Y Yue, Y Wang, B Kang, Y Han, S Wang, S Song, J Feng, G Huang
Neural Information Processing Systems (NeurIPS) 2024, 2024
42024
LLM-based Optimization of Compound AI Systems: A Survey
M Lin, J Sheng, A Zhao, S Wang, Y Yue, Y Wu, H Liu, J Liu, G Huang, ...
arXiv preprint arXiv:2410.16392, 2024
12024
Systém momentálne nemôže vykonať operáciu. Skúste to neskôr.
Články 1–9