Prashanth L.A.

Dikutip oleh

	Semua	Sejak 2020
Kutipan	2441	1483
indeks-h	20	19
indeks-i10	33	28

400

200

100

300

2011201220132014201520162017201820192020202120222023202420259 15 50 64 82 98 76 128 180 208 264 291 387 290 42

Akses publik

Lihat semua

20 artikel

0 artikel

tersedia

tidak tersedia

Berdasarkan pada mandat pendanaan

Pengarang bersama

Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceEmail yang diverifikasi di iisc.ac.in
Michael C. FuUniversity of MarylandEmail yang diverifikasi di umd.edu
Mohammad GhavamzadehAmazon AGIEmail yang diverifikasi di amazon.com
Krishna JagannathanProfessor, Department of Electrical Engineering, IIT MadrasEmail yang diverifikasi di ee.iitm.ac.in
H L PrasadChairman and CTO at Astrome TechnologiesEmail yang diverifikasi di csa.iisc.ernet.in
Rémi MunosFAIR, MetaEmail yang diverifikasi di inria.fr
Ravi Kumar KollaIIT MadrasEmail yang diverifikasi di ee.iitm.ac.in
Csaba SzepesvariDeepMind & University of AlbertaEmail yang diverifikasi di cs.ualberta.ca
Sanjay P. BhatTata Consultancy Services LimitedEmail yang diverifikasi di tcs.com
Cheng JiePinterest LLC, University of Maryland, College Park, Walmart Global TechEmail yang diverifikasi di pinterest.com
Nirmit DesaiIBM ResearchEmail yang diverifikasi di us.ibm.com
Nirav BhavsarM.S. Scholar in the Department of Computer Science and Engineering, Indian Institute of TechnologyEmail yang diverifikasi di cse.iitm.ac.in
Nithia VijayanResearch Fellow, School of Computing, National University of SingaporeEmail yang diverifikasi di comp.nus.edu.sg
Doina PrecupDeepMind and McGill UniversityEmail yang diverifikasi di cs.mcgill.ca
Aditya GopalanIndian Institute of Science, BangaloreEmail yang diverifikasi di iisc.ac.in
Gandharv PatilMcGill University, MilaEmail yang diverifikasi di mail.mcgill.ca
Dheeraj NagarajResearch Scientist, GoogleEmail yang diverifikasi di google.com
gargi dasguptaIBM Research LabEmail yang diverifikasi di in.ibm.com
Steven I. MarcusProfessor of Electrical and Computer Engineering, University of MarylandEmail yang diverifikasi di umd.edu
Andras GyorgyDeepMindEmail yang diverifikasi di google.com

Ikuti

Prashanth L.A.

Associate Professor, Department of Computer Science and Engg., IIT Madras

Email yang diverifikasi di cse.iitm.ac.in - Beranda

Reinforcement learning simulation optimization multi-armed bandits


Judul Urutkan menurut kutipan Urutkan menurut tahun Urutkan menurut judul	Dikutip oleh Dikutip oleh	Tahun
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods S Bhatnagar, HL Prasad, LA Prashanth Springer 434, 302, 2013	440*	2013
Reinforcement Learning With Function Approximation for Traffic Signal Control P LA, S Bhatnagar Intelligent Transportation Systems, IEEE Transactions on, 1-10, 2011	402	2011
Actor-critic algorithms for risk-sensitive MDPs P La, M Ghavamzadeh Advances in neural information processing systems 26, 2013	338	2013
Cumulative prospect theory meets reinforcement learning: Prediction and control LA Prashanth, C Jie, M Fu, S Marcus, C Szepesvári International Conference on Machine Learning, 1406-1415, 2016	97	2016
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	96	2011
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs LA Prashanth, M Ghavamzadeh arXiv preprint arXiv:1403.6530, 2014	89	2014
Policy gradients for CVaR-constrained MDPs LA Prashanth International Conference on Algorithmic Learning Theory, 155-169, 2014	77	2014
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	75	2015
Concentration bounds for empirical conditional value-at-risk: The unbounded case RK Kolla, LA Prashanth, SP Bhat, K Jagannathan Operations Research Letters 47 (1), 16-20, 2019	63	2019
On TD (0) with function approximation: Concentration bounds and a centered variant with exponential convergence N Korda, P La International conference on machine learning, 626-634, 2015	62	2015
Stochastic optimization in a cumulative prospect theory framework C Jie, LA Prashanth, M Fu, S Marcus, C Szepesvári IEEE Transactions on Automatic Control 63 (9), 2867-2882, 2018	59	2018
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions LA Prashanth, K Jagannathan, RK Kolla Proceedings of the 37th International Conference on Machine Learning, 5577-5586, 2020	58	2020
Concentration of risk measures: A Wasserstein distance approach SP Bhat, P LA Advances in neural information processing systems 32, 2019	58	2019
Threshold tuning using stochastic optimization for graded signal control LA Prashanth, S Bhatnagar IEEE Transactions on Vehicular Technology 61 (9), 3865-3880, 2012	54	2012
Risk-sensitive reinforcement learning via policy gradient search LA Prashanth, MC Fu Foundations and Trends® in Machine Learning 15 (5), 537-693, 2022	40	2022
Adaptive system optimization using random directions stochastic approximation LA Prashanth, S Bhatnagar, M Fu, S Marcus IEEE Transactions on Automatic Control 62 (5), 2223-2238, 2017	39	2017
Risk-sensitive reinforcement learning: A constrained optimization viewpoint LA Prashanth, M Fu arXiv 2018, 2018	36	2018
Analysis of stochastic approximation for efficient least squares regression and LSTD LA Prashanth, N Korda, R Munos arXiv preprint arXiv:1306.2557, 2013	28*	2013
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation G Patil, LA Prashanth, D Nagaraj, D Precup International Conference on Artificial Intelligence and Statistics, 5438-5448, 2023	20	2023
A wasserstein distance approach for concentration of empirical risk estimates LA Prashanth, SP Bhat Journal of Machine Learning Research 23 (238), 1-61, 2022	20	2022

Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.

Artikel 1–20

Kutipan per tahun

Kutipan duplikat

Kutipan yang digabung

Tambahkan pengarang bersamaPengarang bersama

Ikuti

Dikutip oleh

Pengarang bersama