Learning to adapt in dynamic, real-world environments through meta-reinforcement learning A Nagabandi, I Clavera, S Liu, RS Fearing, P Abbeel, S Levine, C Finn arXiv preprint arXiv:1803.11347, 2018 | 792* | 2018 |
Model-ensemble trust-region policy optimization T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel arXiv preprint arXiv:1802.10592, 2018 | 577 | 2018 |
Benchmarking model-based reinforcement learning T Wang, X Bao, I Clavera, J Hoang, Y Wen, E Langlois, S Zhang, G Zhang, ... arXiv preprint arXiv:1907.02057, 2019 | 490 | 2019 |
Model-based reinforcement learning via meta-policy optimization I Clavera, J Rothfuss, J Schulman, Y Fujita, T Asfour, P Abbeel Conference on Robot Learning, 617-629, 2018 | 313 | 2018 |
Promp: Proximal meta-policy search J Rothfuss, D Lee, I Clavera, T Asfour, P Abbeel arXiv preprint arXiv:1810.06784, 2018 | 245 | 2018 |
Model-augmented actor-critic: Backpropagating through paths I Clavera, V Fu, P Abbeel arXiv preprint arXiv:2005.08068, 2020 | 104 | 2020 |
Sub-policy adaptation for hierarchical reinforcement learning AC Li, C Florensa, I Clavera, P Abbeel arXiv preprint arXiv:1906.05862, 2019 | 100 | 2019 |
Openai o1 system card A Jaech, A Kalai, A Lerer, A Richardson, A El-Kishky, A Low, A Helyar, ... arXiv preprint arXiv:2412.16720, 2024 | 71 | 2024 |
Policy transfer via modularity and reward guiding I Clavera, D Held, P Abbeel 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017 | 57 | 2017 |
Trajectory-wise multiple choice learning for dynamics generalization in reinforcement learning Y Seo, K Lee, I Clavera Gilaberte, T Kurutach, J Shin, P Abbeel Advances in Neural Information Processing Systems 33, 12968-12979, 2020 | 46 | 2020 |
Asynchronous methods for model-based reinforcement learning Y Zhang, I Clavera, B Tsai, P Abbeel arXiv preprint arXiv:1910.12453, 2019 | 37 | 2019 |
Mutual information maximization for robust plannable representations Y Ding, I Clavera, P Abbeel arXiv preprint arXiv:2005.08114, 2020 | 12 | 2020 |
Overcoming Model-Bias in Reinforcement Learning IC Gilaberte University of California, Berkeley, 2020 | | 2020 |
Policy Transfer Via Modularity IC Gilaberte Universitat Politècnica de Catalunya. Facultat de Matemàtiques i Estadística, 2017 | | 2017 |
Towards SLAM with an events-based camera I Clavera Gilaberte, J Solà Ortega, J Andrade-Cetto | | 2016 |
MODEL-ENSEMBLE TRUST-REGION POLICY OPTI T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel | | |
R-LAtte: Attention Module for Visual Control via Reinforcement Learning M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel | | |