Takip et
Ruijie Zheng
Ruijie Zheng
umd.edu üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Who is the strongest enemy? towards optimal and efficient evasion attacks in deep rl
Y Sun, R Zheng, Y Liang, F Huang
ICLR 2022, 2021
782021
Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning
Y Liang, Y Sun, R Zheng, F Huang
Advances in Neural Information Processing Systems 35, 22547-22561, 2022
572022
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
R Zheng, X Wang, Y Sun, S Ma, J Zhao, H Xu, H Daumé III, F Huang
Advances in Neural Information Processing Systems 36, 2024, 2023
45*2023
Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication
Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang
The Eleventh International Conference on Learning Representations, 2022
31*2022
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ...
The Twelfth International Conference on Learning Representations (ICLR 2024), 2023
262023
Transfer RL across observation feature spaces via model-based regularization
Y Sun, R Zheng, X Wang, A Cohen, F Huang
The Eleventh International Conference on Learning Representations, 2022
212022
Is imitation all you need? generalized decision-making with dual-phase training
Y Wei, Y Sun, R Zheng, S Vemprala, R Bonatti, S Chen, R Madaan, Z Ba, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
182023
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
R Zheng, X Wang, H Xu, F Huang
The Eleventh International Conference on Learning Representations, 2023
162023
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ...
arXiv preprint arXiv:2402.06187, 2024
62024
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan, H Xu, F Huang
arXiv preprint arXiv:2310.07220, 2023
62023
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
T Ji, Y Liang, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, H Xu
arXiv preprint arXiv:2402.14528, 2024
52024
Game-theoretic robust reinforcement learning handles temporally-coupled perturbations
Y Liang, Y Sun, R Zheng, X Liu, T Sandholm, F Huang, S McAleer
The Twelfth International Conference on Learning Representations (ICLR 2024), 2023
52023
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
R Zheng, CA Cheng, H Daumé III, F Huang, A Kolobov
Forty-first International Conference on Machine Learning, 0
5*
Tracevla: Visual trace prompting enhances spatial-temporal awareness for generalist robotic policies
R Zheng, Y Liang, S Huang, J Gao, H Daumé III, A Kolobov, F Huang, ...
arXiv preprint arXiv:2412.10345, 2024
42024
Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Y Xu, C Deng, Y Sun, R Zheng, X Wang, J Zhao, F Huang
arXiv preprint arXiv:2309.03426, 2023
42023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Y Xu, C Deng, Y Sun, R Zheng, X Wang, J Zhao, F Huang
Forty-first International Conference on Machine Learning, 0
2
TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
S Huang, M Levy, A Gupta, D Ekpo, R Zheng, A Shrivastava
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–17