Παρακολούθηση
Gokul Swamy
Gokul Swamy
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα andrew.cmu.edu - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
G Swamy, S Choudhury, JA Bagnell, ZS Wu
38th International Conference on Machine Learning (ICML), 2021
84*2021
On the Utility of Model Learning in HRI
G Swamy, J Schulz, R Choudhury, D Hadfield-Menell, A Dragan
arXiv preprint arXiv:1901.01291, 2019
68*2019
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
G Swamy, C Dann, R Kidambi, ZS Wu, A Agarwal
arXiv preprint arXiv:2401.04056, 2024
582024
Scaled autonomy: Enabling human operators to control robot fleets
G Swamy, S Reddy, S Levine, AD Dragan
2020 IEEE International Conference on Robotics and Automation (ICRA), 5942-5948, 2020
502020
Sequence model imitation learning with unobserved contexts
G Swamy, S Choudhury, J Bagnell, SZ Wu
Advances in Neural Information Processing Systems 35, 17665-17676, 2022
292022
Causal imitation learning under temporally correlated noise
G Swamy, S Choudhury, D Bagnell, S Wu
International Conference on Machine Learning, 20877-20890, 2022
282022
Inverse Reinforcement Learning without Reinforcement Learning
G Swamy, S Choudhury, D Bagnell, S Wu
International Conference on Machine Learning, 33299-33318, 2023
252023
REBEL: Reinforcement Learning via Regressing Relative Rewards
Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ...
arXiv preprint arXiv:2404.16767, 2024
172024
Minimax Optimal Online Imitation Learning via Replay Estimation
G Swamy, N Rajaraman, M Peng, S Choudhury, J Bagnell, SZ Wu, J Jiao, ...
Advances in Neural Information Processing Systems 35, 7077-7088, 2022
172022
Learning Shared Safety Constraints from Multi-task Demonstrations
K Kim, G Swamy, Z Liu, D Zhao, S Choudhury, SZ Wu
Advances in Neural Information Processing Systems 36, 2024
132024
Hybrid Inverse Reinforcement Learning
J Ren, G Swamy, ZS Wu, JA Bagnell, S Choudhury
arXiv preprint arXiv:2402.08848, 2024
112024
Understanding Preference Fine-Tuning Through the Lens of Coverage
Y Song, G Swamy, A Singh, JA Bagnell, W Sun
arXiv preprint arXiv:2406.01462, 2024
9*2024
EvIL: Evolution Strategies for Generalisable Imitation Learning
S Sapora, G Swamy, C Lu, YW Teh, JN Foerster
arXiv preprint arXiv:2406.11905, 2024
42024
A Critique of Strictly Batch Imitation Learning
G Swamy, S Choudhury, JA Bagnell, ZS Wu
arXiv preprint arXiv:2110.02063, 2021
32021
Generative Models for Pose Transfer
P Chao, A Li, G Swamy
arXiv preprint arXiv:1806.09070, 2018
32018
Diffusing States and Matching Scores: A New Framework for Imitation Learning
R Wu, Y Chen, G Swamy, K Brantley, W Sun
arXiv preprint arXiv:2410.13855, 2024
12024
Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
J Tang, G Swamy, F Fang, ZS Wu
arXiv preprint arXiv:2406.04219, 2024
12024
Your Learned Constraint is Secretly a Backward Reachable Tube
M Qadri, G Swamy, J Francis, M Kaess, A Bajcsy
arXiv preprint arXiv:2501.15618, 2025
2025
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Z Gao, W Zhan, JD Chang, G Swamy, K Brantley, JD Lee, W Sun
arXiv preprint arXiv:2410.04612, 2024
2024
Efficient Inverse Reinforcement Learning without Compounding Errors
NE Dice, G Swamy, S Choudhury, W Sun
First Reinforcement Learning Safety Workshop, 2024
2024
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20