A. Rupam Mahmood

צוטט על ידי

	הכל	מאז 2020
ציטוטים ביבליוגרפיים	1899	1487
H-index	20	18
i10-index	30	28

460

230

115

345

20132014201520162017201820192020202120222023202420257 20 34 57 58 97 125 182 237 240 286 444 95

גישה ציבורית

הצג הכל

9 מאמרים

0 מאמרים

זמין

לא זמין

על סמך ייפוי כח מהמממנים

מחברים משותפים

Richard S. SuttonKeen, Amii, and University of Albertaכתובת אימייל מאומתת בדומיין richsutton.com
Gautham VasanAmii, University of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca
James BergstraPrincipal Engineer, Ocado Technologyכתובת אימייל מאומתת בדומיין ocado.com
Martha WhiteUniversity of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca
Dmytro KorenkevychMeta AIכתובת אימייל מאומתת בדומיין meta.com
Qingfeng LanPhD student @ University of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca
Shibhansh DoharePhD Student, University of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLכתובת אימייל מאומתת בדומיין google.com
Patrick M. PilarskiProfessor, University of Alberta, Amii (Alberta Machine Intelligence Institute)כתובת אימייל מאומתת בדומיין ualberta.ca
Brent KomerPhD Student, University of Waterlooכתובת אימייל מאומתת בדומיין uwaterloo.ca
Harm van SeijenSony AIכתובת אימייל מאומתת בדומיין sony.com
Doina PrecupDeepMind and McGill Universityכתובת אימייל מאומתת בדומיין cs.mcgill.ca
Marlos C. MachadoUniversity of Alberta; Amiiכתובת אימייל מאומתת בדומיין ualberta.ca
Thomas DegrisDeepMindכתובת אימייל מאומתת בדומיין google.com
Fengdi Cheuniversity of albertaכתובת אימייל מאומתת בדומיין ualberta.ca
Oliver LimoyoUniversity of Toronto Institute for Aerospace Studiesכתובת אימייל מאומתת בדומיין mail.utoronto.ca
Bryan ChanUniversity of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca
Jonathan KellyUniversity of Toronto Institute for Aerospace Studiesכתובת אימייל מאומתת בדומיין utias.utoronto.ca
Mohamed ElsayedPhD student @ University of Albertaכתובת אימייל מאומתת בדומיין ualberta.ca

עקוב אחר

A. Rupam Mahmood

University of Alberta, Amii

כתובת אימייל מאומתת בדומיין ualberta.ca - דף הבית

Continual learning reinforcement learning robot learning representation learning


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
An emphatic approach to the problem of off-policy temporal-difference learning‏ RS Sutton, AR Mahmood, M White‏ (JMLR) Journal of Machine Learning Research 17, 2016‏	328	2016
Benchmarking reinforcement learning algorithms on real-world robots‏ AR Mahmood, D Korenkevych, G Vasan, W Ma, J Bergstra‏ (CoRL) Proceedings of the 2nd Annual Conference on Robot Learning, 2018‏	235	2018
Weighted importance sampling for off-policy learning with linear function approximation‏ AR Mahmood, H van Hasselt, RS Sutton‏ (NeurIPS) Advances in Neural Information Processing Systems 27, 2014‏	191	2014
True online temporal-difference learning‏ H van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton‏ (JMLR) Journal of Machine Learning Research 17, 2016‏	125	2016
Setting up a reinforcement learning task with a real-world robot‏ AR Mahmood, D Korenkevych, BJ Komer, J Bergstra‏ (IROS) 2018 IEEE/RSJ International Conference on Intelligent Robots and …, 2018‏	106	2018
Tuning-free step-size adaptation‏ AR Mahmood, RS Sutton, T Degris, PM Pilarski‏ (ICASSP) Acoustics, Speech and Signal Processing, 2012 IEEE International …, 2012‏	89	2012
Continual backprop: Stochastic gradient descent with persistent randomness‏ S Dohare, RS Sutton, AR Mahmood‏ arXiv preprint arXiv:2108.06325, 2021‏	79	2021
Loss of plasticity in deep continual learning‏ S Dohare, JF Hernandez-Garcia, Q Lan, P Rahman, AR Mahmood, ...‏ Nature 632 (8026), 768-774, 2024‏	59	2024
Multi-step off-policy learning without importance sampling ratios‏ AR Mahmood, H Yu, RS Sutton‏ arXiv preprint arXiv:1702.03006, 2017‏	55	2017
Representation Search through Generate and Test‏ AR Mahmood, RS Sutton‏ Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013‏	50	2013
Off-policy TD (λ) with a true online equivalence‏ H van Hasselt, AR Mahmood, RS Sutton‏ (UAI) Proceedings of the 30th Conference on Uncertainty in Artificial …, 2014‏	48	2014
On generalized Bellman equations and temporal-difference learning‏ H Yu, AR Mahmood, RS Sutton‏ (JMLR) The Journal of Machine Learning Research 19 (1), 1864-1912, 2018‏	42	2018
A new Q (λ) with interim forward view and Monte Carlo equivalence‏ RS Sutton, AR Mahmood, D Precup, M CA, H van Hasselt, U CA‏ (ICML) In International Conference on Machine Learning, 2014‏	40	2014
Emphatic temporal-difference learning‏ AR Mahmood, H Yu, M White, RS Sutton‏ In European Workshops on Reinforcement Learning, 2015‏	39	2015
Greedification operators for policy optimization: investigating forward and reverse KL divergences‏ A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White‏ (JMLR) Journal of Machine Learning Research, 2022‏	35	2022
Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning‏ M Elsayed, AR Mahmood‏ (ICLR) The Twelfth International Conference on Learning Representations, 2024‏	32*	2024
Off-policy learning based on weighted importance sampling with linear computational complexity‏ AR Mahmood, RS Sutton‏ (UAI) Proceedings of the 31st Conference on Uncertainty in Artificial …, 2015‏	32	2015
Maintaining plasticity in deep continual learning‏ S Dohare, JF Hernandez-Garcia, P Rahman, AR Mahmood, RS Sutton‏ arXiv preprint arXiv:2306.13812, 2023‏	29	2023
Autoregressive policies for continuous control deep reinforcement learning‏ D Korenkevych, AR Mahmood, G Vasan, J Bergstra‏ (IJCAI) Proceedings of the 28th International Joint Conference on Artificial …, 2019‏	29	2019
Incremental Off-policy Reinforcement Learning Algorithms‏ A Mahmood‏ University of Alberta, 2017‏	22	2017

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–20

ציטוטים ביבליוגרפיים בשנה

ציטוטים ביביליוגרפיים כפולים

ציטוטים ביביליוגרפיים שמוזגו

הוסף מחברים שותפיםמחברים משותפים

עקוב אחר

צוטט על ידי

מחברים משותפים