Follow
Huayu Chen
Title
Cited by
Cited by
Year
Tianshou: A highly modularized deep reinforcement learning library
J Weng*, H Chen*, D Yan, K You, A Duburcq, M Zhang, H Su, J Zhu
Journal of Machine Learning Research 23 (267), 2022
2452022
Offline reinforcement learning via high-fidelity generative behavior modeling
H Chen, C Lu, C Ying, H Su, J Zhu
The Eleventh International Conference on Learning Representations (ICLR), 2022
902022
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
C Lu*, H Chen*, J Chen, H Su, C Li, J Zhu
International Conference on Machine Learning (ICML), 2023
532023
Noise contrastive alignment of language models with explicit rewards
H Chen, G He, H Su, J Zhu
The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
182024
Rdt-1b: a diffusion foundation model for bimanual manipulation
S Liu, L Wu, B Li, H Tan, H Chen, Z Wang, K Xu, H Su, J Zhu
International Conference on Learning Representations (ICLR), 2024
132024
Score Regularized Policy Optimization through Diffusion Behavior
H Chen, C Lu, Z Wang, H Su, J Zhu
The Twelfth International Conference on Learning Representations (ICLR), 2023
122023
Free process rewards without process labels
L Yuan, W Li, H Chen, G Cui, N Ding, K Zhang, B Zhou, Z Liu, H Peng
arXiv preprint arXiv:2412.01981, 2024
22024
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
T Luo, T Pearce, H Chen, J Chen, J Zhu
The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
22024
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
H Chen, H Su, P Sun, J Zhu
International Conference on Learning Representations (ICLR), 2024
12024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
H Chen, K Zheng, H Su, J Zhu
The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
2024
Converging and Stabilizing Generative Adversarial Imitation Learning
T Luo, J Chen, H Chen, J Zhu
The system can't perform the operation now. Try again later.
Articles 1–11