Adithya M Devraj

Geciteerd door

	Alles	Sinds 2020
Citaties	561	474
h-index	14	12
i10-index	19	16

120

20162017201820192020202120222023202420254 11 16 45 70 102 82 105 105 9

Openbare toegang

Alles bekijken

17 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Sean MeynProfessor of ECE and Robert C. Pittman Eminent Scholar ChairGeverifieerd e-mailadres voor ece.ufl.edu
Ana BusicInria, Computer Science Department at École normale supérieureGeverifieerd e-mailadres voor ens.fr
Ioannis KontoyiannisUniversity of CambridgeGeverifieerd e-mailadres voor cam.ac.uk
Andrey BernsteinPrincipal Researcher and Group Manager, National Renewable Energy Laboratory (NREL)Geverifieerd e-mailadres voor nrel.gov
Fan LuUniversity of FloridaGeverifieerd e-mailadres voor ufl.edu
Naren Srivaths RamanMathWorksGeverifieerd e-mailadres voor mathworks.com
Prabir BarooahProfessor, Electronics and Electrical Engineering, Indian Institute of Technology GuwahatiGeverifieerd e-mailadres voor iitg.ac.in
Yue ChenResearcher at National Renewable Energy LaboratoryGeverifieerd e-mailadres voor nrel.gov
Vivek BorkarIndian Institute of Technology BombayGeverifieerd e-mailadres voor ee.iitb.ac.in
Xu Kuang (许匡)Associate Professor, Stanford Graduate School of BusinessGeverifieerd e-mailadres voor stanford.edu
Benjamin Van RoyStanford UniversityGeverifieerd e-mailadres voor stanford.edu
Anand RadhakrishnanUniv of FloridaGeverifieerd e-mailadres voor ufl.edu
Jianshu ChenPrincipal Scientist, AmazonGeverifieerd e-mailadres voor ucla.edu
Chandra MurthyProfessor, ECE, Indian Institute of ScienceGeverifieerd e-mailadres voor iisc.ac.in
Mohit SharmaTechnology Innovation InstituteGeverifieerd e-mailadres voor tii.ae

Volgen

Adithya M Devraj

Amazon

Geverifieerd e-mailadres voor stanford.edu - Homepage

Reinforcement Learning Stochastic Approximation Stochastic Control


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Zap Q-learning AM Devraj, S Meyn Advances in Neural Information Processing Systems 30, 2017	108	2017
Fastest convergence for Q-learning AM Devraj, SP Meyn arXiv preprint arXiv:1707.03770, 2017	48	2017
Reinforcement learning for control of building HVAC systems NS Raman, AM Devraj, P Barooah, SP Meyn 2020 American Control Conference (ACC), 2326-2332, 2020	46	2020
Explicit mean-square error bounds for monte-carlo and linear stochastic approximation S Chen, A Devraj, A Busic, S Meyn International Conference on Artificial Intelligence and Statistics, 4173-4183, 2020	41	2020
Model-free primal-dual methods for network optimization with application to real-time optimal power flow Y Chen, A Bernstein, A Devraj, S Meyn 2020 American Control Conference (ACC), 3140-3147, 2020	38	2020
The ODE method for asymptotic statistics in stochastic approximation and reinforcement learning V Borkar, S Chen, A Devraj, I Kontoyiannis, S Meyn arXiv preprint arXiv:2110.14427, 2021	26	2021
Fundamental design principles for reinforcement learning algorithms AM Devraj, A Bušić, S Meyn Handbook of Reinforcement Learning and Control, 75-137, 2021	24	2021
Zap Q-Learning with nonlinear function approximation S Chen, AM Devraj, F Lu, A Busic, S Meyn Advances in Neural Information Processing Systems 33, 16879-16890, 2020	19	2020
Stochastic variance reduced primal dual algorithms for empirical composition optimization AM Devraj, J Chen Advances in Neural Information Processing Systems 32, 2019	18	2019
Learning techniques for feedback particle filter design A Radhakrishnan, A Devraj, S Meyn 2016 IEEE 55th Conference on Decision and Control (CDC), 5453-5459, 2016	18	2016
Zap Q-Learning-a user's guide AM Devraj, A Bušić, S Meyn 2019 Fifth Indian Control Conference (ICC), 10-15, 2019	17	2019
Differential TD learning for value function approximation AM Devraj, SP Meyn Decision and Control (CDC), 2016 IEEE 55th Conference on, 6347-6354, 2016	17	2016
On matrix momentum stochastic approximation and applications to Q-learning AM Devraj, A Bušić, S Meyn 2019 57th Annual Allerton Conference on Communication, Control, and …, 2019	16*	2019
Q-learning with uniformly bounded variance AM Devraj, SP Meyn IEEE Transactions on Automatic Control 67 (11), 5948-5963, 2021	15	2021
Differential temporal difference learning AM Devraj, I Kontoyiannis, SP Meyn IEEE Transactions on Automatic Control 66 (10), 4652-4667, 2020	13	2020
Q-learning with uniformly bounded variance: Large discounting is not a barrier to fast learning AM Devraj, SP Meyn arXiv preprint arXiv:2002.10301, 2020	13	2020
Zap Q-Learning for optimal stopping S Chen, AM Devraj, A Bušić, S Meyn 2020 American Control Conference (ACC), 3920-3925, 2020	12	2020
Accelerating optimization and reinforcement learning with quasi stochastic approximation S Chen, A Devraj, A Bernstein, S Meyn 2021 American Control Conference (ACC), 1965-1972, 2021	11	2021
Power allocation in energy harvesting sensors with ARQ: A convex optimization approach AM Devraj, MK Sharma, CR Murthy 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP …, 2014	10	2014
Gaussian imagination in bandit learning Y Liu, AM Devraj, B Van Roy, K Xu arXiv preprint arXiv:2201.01902, 2022	9	2022

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs