Andrea Zanette

عدد مرات الاقتباسات

	الكل	قبل 2020
اقتباسات	1369	1335
h-index	14	14
i10-index	15	15

340

170

255

201920202021202220232024202530 103 249 296 312 340 35

عدد المنشورات المتاحة للجميع

عرض المجموعة جميعها

8 مقالات

0 مقالة

المقالات البحثية المتاحة للجميع

المقالات البحثية غير المتاحة للجميع

تمّ اختيار المعلومات استنادًا إلى تفويضات التمويل

متابعة

Andrea Zanette

Assistant Professor, Carnegie Mellon University

بريد إلكتروني تم التحقق منه على andrew.cmu.edu - الصفحة الرئيسية

Foundation Models Artificial Intelligence Machine Learning Reinforcement Learning


عنوان ترتيب حسب الاقتباسات ترتيب حسب السنة الترتيب حسب العنوان	عدد مرات الاقتباسات عدد مرات الاقتباسات	السنة
Tighter problem-dependent regret bounds in reinforcement learning without domain knowledge using value function bounds‏ A Zanette, E Brunskill‏ International Conference on Machine Learning, 7304-7312, 2019‏	322	2019
Learning near optimal policies with low inherent bellman error‏ A Zanette, A Lazaric, M Kochenderfer, E Brunskill‏ International Conference on Machine Learning, 10978-10989, 2020‏	259	2020
Frequentist regret bounds for randomized least-squares value iteration‏ A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric‏ International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020‏	156	2020
Provable benefits of actor-critic methods for offline reinforcement learning‏ A Zanette, MJ Wainwright, E Brunskill‏ Advances in neural information processing systems 34, 13626-13640, 2021‏	145	2021
Exponential lower bounds for batch reinforcement learning: Batch rl can be exponentially harder than online rl‏ A Zanette‏ International Conference on Machine Learning, 12287-12297, 2021‏	90	2021
Provably efficient reward-agnostic navigation with linear value iteration‏ A Zanette, A Lazaric, MJ Kochenderfer, E Brunskill‏ Advances in Neural Information Processing Systems 33, 11756-11766, 2020‏	70	2020
Cautiously optimistic policy optimization and exploration with linear function approximation‏ A Zanette, CA Cheng, A Agarwal‏ Conference on Learning Theory, 4473-4525, 2021‏	62	2021
Almost horizon-free structure-aware best policy identification with a generative model‏ A Zanette, MJ Kochenderfer, E Brunskill‏ Advances in Neural Information Processing Systems 32, 2019‏	41	2019
Limiting extrapolation in linear approximate value iteration‏ A Zanette, A Lazaric, MJ Kochenderfer, E Brunskill‏ Advances in Neural Information Processing Systems 32, 2019‏	40	2019
Robust super-level set estimation using Gaussian processes‏ A Zanette, J Zhang, MJ Kochenderfer‏ Joint European Conference on Machine Learning and Knowledge Discovery in …, 2018‏	40	2018
Design of experiments for stochastic contextual linear bandits‏ A Zanette, K Dong, JN Lee, E Brunskill‏ Advances in Neural Information Processing Systems 34, 22720-22731, 2021‏	31	2021
Archer: Training language model agents via hierarchical multi-turn rl‏ Y Zhou, A Zanette, J Pan, S Levine, A Kumar‏ arXiv preprint arXiv:2402.19446, 2024‏	27	2024
Problem dependent reinforcement learning bounds which can identify bandit structure in mdps‏ A Zanette, E Brunskill‏ International Conference on Machine Learning, 5747-5755, 2018‏	24	2018
When is realizability sufficient for off-policy reinforcement learning?‏ A Zanette‏ International Conference on Machine Learning, 40637-40668, 2023‏	19	2023
Bellman residual orthogonalization for offline reinforcement learning‏ A Zanette, MJ Wainwright‏ Advances in Neural Information Processing Systems 35, 3137-3151, 2022‏	11	2022
Policy finetuning in reinforcement learning via design of experiments using offline data‏ R Zhang, A Zanette‏ Advances in Neural Information Processing Systems 36, 2024‏	8	2024
Information directed reinforcement learning‏ A Zanette, R Sarkar‏ Tech. Rep., Technical report, Technical report, 2017‏	7	2017
Stabilizing q-learning with linear architectures for provable efficient learning‏ A Zanette, M Wainwright‏ International Conference on Machine Learning, 25920-25954, 2022‏	6	2022
Accelerating Best-of-N via Speculative Rejection‏ R Zhang, M Haider, M Yin, J Qiu, M Wang, P Bartlett, A Zanette‏ 2nd Workshop on Advancing Neural Network Training: Computational Efficiency …, 0‏	6
Fast best-of-n decoding via speculative rejection‏ H Sun, M Haider, R Zhang, H Yang, J Qiu, M Yin, M Wang, P Bartlett, ...‏ arXiv preprint arXiv:2410.20290, 2024‏	3	2024

يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.

مقالات 1–20

عدد الاقتباسات في العام

اقتباسات مكررة

الاقتباسات المدمجة

إضافة مؤلفين مشاركينالمؤلفون المشاركون

متابعة

عدد مرات الاقتباسات