Seguir
Xu-Hui Liu
Xu-Hui Liu
Dirección de correo verificada de lamda.nju.edu.cn
Título
Citado por
Citado por
Año
Regret minimization experience replay in off-policy reinforcement learning
XH Liu, Z Xue, J Pang, S Jiang, F Xu, Y Yu
Advances in neural information processing systems 34, 17604-17615, 2021
442021
How to guide your learner: Imitation learning with active adaptive expert involvement
XH Liu, F Xu, X Zhang, T Liu, S Jiang, R Chen, Z Zhang, Y Yu
arXiv preprint arXiv:2303.02073, 2023
112023
Hybrid value estimation for off-policy evaluation and offline reinforcement learning
XK Jin, XH Liu, S Jiang, Y Yu
arXiv preprint arXiv:2206.02000, 2022
72022
Foresight distribution adjustment for off-policy reinforcement learning
R Chen, XH Liu, TS Liu, S Jiang, F Xu, Y Yu
Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024
62024
The teaching dimension of regularized kernel learners
H Qian, XH Liu, CX Su, A Zhou, Y Yu
International Conference on Machine Learning, 17984-18002, 2022
52022
Cascaded algorithm selection with extreme-region UCB bandit
YQ Hu, XH Liu, SQ Li, Y Yu
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 6782 …, 2021
52021
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
C Jia, F Zhang, YC Li, CX Gao, XH Liu, L Yuan, Z Zhang, Y Yu
arXiv preprint arXiv:2403.07261, 2024
32024
Offline Transition Modeling via Contrastive Energy Learning
R Chen, C Jia, Z Huang, TS Liu, XH Liu, Y Yu
Forty-first International Conference on Machine Learning, 0
3
Energy-guided diffusion sampling for offline-to-online reinforcement learning
XH Liu, TS Liu, S Jiang, R Chen, Z Zhang, X Chen, Y Yu
arXiv preprint arXiv:2407.12448, 2024
2024
Semantic Skill Extraction via Vision-Language Model Guidance for Efficient Reinforcement Learning
TS Liu, XH Liu, R Chen, L Jin, P Wang, Z Zhang, Y Yu
The Thirteenth International Conference on Learning Representations, 0
Deep Demonstration Tracing: Learning Generalized Imitator for Runtime Imitation from a Single Demonstration
XH Chen, J Ye, H Zhao, YC Li, XH Liu, H Shi, YY Xu, Z Ye, SH Yang, Y Yu, ...
Forty-first International Conference on Machine Learning, 0
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–11