Tianshou: A highly modularized deep reinforcement learning library J Weng*, H Chen*, D Yan, K You, A Duburcq, M Zhang, H Su, J Zhu Journal of Machine Learning Research 23 (267), 2022 | 245 | 2022 |
Offline reinforcement learning via high-fidelity generative behavior modeling H Chen, C Lu, C Ying, H Su, J Zhu The Eleventh International Conference on Learning Representations (ICLR), 2022 | 90 | 2022 |
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning C Lu*, H Chen*, J Chen, H Su, C Li, J Zhu International Conference on Machine Learning (ICML), 2023 | 53 | 2023 |
Noise contrastive alignment of language models with explicit rewards H Chen, G He, H Su, J Zhu The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 | 18 | 2024 |
Rdt-1b: a diffusion foundation model for bimanual manipulation S Liu, L Wu, B Li, H Tan, H Chen, Z Wang, K Xu, H Su, J Zhu International Conference on Learning Representations (ICLR), 2024 | 13 | 2024 |
Score Regularized Policy Optimization through Diffusion Behavior H Chen, C Lu, Z Wang, H Su, J Zhu The Twelfth International Conference on Learning Representations (ICLR), 2023 | 12 | 2023 |
Free process rewards without process labels L Yuan, W Li, H Chen, G Cui, N Ding, K Zhang, B Zhou, Z Liu, H Peng arXiv preprint arXiv:2412.01981, 2024 | 2 | 2024 |
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory T Luo, T Pearce, H Chen, J Chen, J Zhu The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 | 2 | 2024 |
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment H Chen, H Su, P Sun, J Zhu International Conference on Learning Representations (ICLR), 2024 | 1 | 2024 |
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control H Chen, K Zheng, H Su, J Zhu The Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 | | 2024 |
Converging and Stabilizing Generative Adversarial Imitation Learning T Luo, J Chen, H Chen, J Zhu | | |