Prashanth L.A.

Cited by

	All	Since 2019
Citations	2216	1432
h-index	18	17
i10-index	29	26

440

220

110

330

201120122013201420152016201720182019202020212022202320249 15 49 63 80 99 76 127 177 201 273 284 422 73

Public access

View all

18 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerified email at iisc.ac.in
Michael C. FuUniversity of MarylandVerified email at umd.edu
Mohammad GhavamzadehAmazonVerified email at amazon.com
H L PrasadChairman and CTO at Astrome TechnologiesVerified email at csa.iisc.ernet.in
Krishna JagannathanAssociate Professor, Department of Electrical Engineering, IIT MadrasVerified email at ee.iitm.ac.in
Rémi MunosDeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Sanjay P. BhatTata Consultancy Services LimitedVerified email at tcs.com
Ravi Kumar KollaIIT MadrasVerified email at ee.iitm.ac.in
Cheng JiePinterest LLC, University of Maryland, College Park, Walmart Global TechVerified email at pinterest.com
Nirmit DesaiIBM ResearchVerified email at us.ibm.com
Nirav BhavsarM.S. Scholar in the Department of Computer Science and Engineering, Indian Institute of TechnologyVerified email at cse.iitm.ac.in
Aditya GopalanIndian Institute of Science, BangaloreVerified email at iisc.ac.in
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
gargi dasguptaIBM Research LabVerified email at in.ibm.com
Gandharv PatilMcGill University, MilaVerified email at mail.mcgill.ca
Dheeraj NagarajResearch Scientist, GoogleVerified email at google.com
Nithia VijayanResearch Scholar, Department of Computer Science and Engineering, Indian Institute of TechnologyVerified email at cse.iitm.ac.in
Steven I. MarcusProfessor of Electrical and Computer Engineering, University of MarylandVerified email at umd.edu
Andras GyorgyDeepMindVerified email at google.com

Prashanth L.A.

Associate Professor, Department of Computer Science and Engg., IIT Madras

Verified email at cse.iitm.ac.in - Homepage

Reinforcement learning simulation optimization multi-armed bandits


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods S Bhatnagar, HL Prasad, LA Prashanth Springer 434, 302, 2013	447*	2013
Reinforcement Learning With Function Approximation for Traffic Signal Control P LA, S Bhatnagar Intelligent Transportation Systems, IEEE Transactions on, 1-10, 2011	372	2011
Actor-critic algorithms for risk-sensitive MDPs P La, M Ghavamzadeh Advances in neural information processing systems 26, 2013	336	2013
Cumulative prospect theory meets reinforcement learning: Prediction and control LA Prashanth, C Jie, M Fu, S Marcus, C Szepesvári International Conference on Machine Learning, 1406-1415, 2016	85	2016
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	84	2011
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs LA Prashanth, M Ghavamzadeh arXiv preprint arXiv:1403.6530, 2014	78	2014
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	68	2015
Policy gradients for CVaR-constrained MDPs LA Prashanth International Conference on Algorithmic Learning Theory, 155-169, 2014	66	2014
Concentration bounds for empirical conditional value-at-risk: The unbounded case RK Kolla, LA Prashanth, SP Bhat, K Jagannathan Operations Research Letters 47 (1), 16-20, 2019	54	2019
Threshold tuning using stochastic optimization for graded signal control LA Prashanth, S Bhatnagar IEEE Transactions on Vehicular Technology 61 (9), 3865-3880, 2012	52	2012
Concentration of risk measures: A Wasserstein distance approach SP Bhat, P LA Advances in neural information processing systems 32, 2019	50	2019
On TD (0) with function approximation: Concentration bounds and a centered variant with exponential convergence N Korda, P La International conference on machine learning, 626-634, 2015	50	2015
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions LA Prashanth, K Jagannathan, RK Kolla Proceedings of the 37th International Conference on Machine Learning, 5577-5586, 2020	47	2020
Stochastic optimization in a cumulative prospect theory framework C Jie, LA Prashanth, M Fu, S Marcus, C Szepesvári IEEE Transactions on Automatic Control 63 (9), 2867-2882, 2018	47	2018
Risk-sensitive reinforcement learning: A constrained optimization viewpoint LA Prashanth, M Fu arXiv 2018, 2018	35	2018
Adaptive system optimization using random directions stochastic approximation LA Prashanth, S Bhatnagar, M Fu, S Marcus IEEE Transactions on Automatic Control 62 (5), 2223-2238, 2017	34	2017
Risk-sensitive reinforcement learning via policy gradient search LA Prashanth, MC Fu Foundations and Trends® in Machine Learning 15 (5), 537-693, 2022	25	2022
Analysis of stochastic approximation for efficient least squares regression and LSTD LA Prashanth, N Korda, R Munos arXiv preprint arXiv:1306.2557, 2013	23*	2013
Risk-aware multi-armed bandits using conditional value-at-risk RK Kolla, LA Prashanth, K Jagannathan arXiv preprint arXiv:1901.00997, 2019	17	2019
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles X Hu, LA Prashanth, A György, C Szepesvári International Conference on Artificial Intelligence and Statistics (AISTATS …, 2016	17	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors