Wesley A Suttle

Citeras av

	Alla	Sedan 2020
Citat	137	133
h-index	4	4
i10-index	3	3

20192020202120222023202420253 11 12 24 22 59 4

Offentlig åtkomst

Visa alla

5 artiklar

1 artikel

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Ji LiuStony Brook UniversityVerifierad e-postadress på stonybrook.edu
Brian M SadlerThe University of Texas at AustinVerifierad e-postadress på ieee.org
Amrit Singh BediAssistant Professor in Department of Computer Science, University of Central Florida, FL, USAVerifierad e-postadress på ucf.edu
Dinesh ManochaDistinguished University Professor, University of Maryland at College ParkVerifierad e-postadress på umd.edu
Alec KoppelResearch Lead, JP Morgan AI ResearchVerifierad e-postadress på jpmchase.com
Kaiqing ZhangAssistant Professor, University of Maryland, College ParkVerifierad e-postadress på umd.edu
Bhrij PatelCS PhD Student, University of Maryland, College ParkVerifierad e-postadress på umd.edu
Zhaoran WangAssociate Professor at Northwestern UniversityVerifierad e-postadress på northwestern.edu
Tamer BaşarSwanlund Endowed Chair Emeritus & CAS Professor Emeritus of ECE, University of IllinoisVerifierad e-postadress på illinois.edu
Vaneet AggarwalPurdue UniversityVerifierad e-postadress på purdue.edu
Krishna Chaitanya KosarajuVehicle Automation, University of ClemsonVerifierad e-postadress på clemson.edu
Sivaranjani Seetharaman (S Sivaranja...Assistant Professor, Purdue UniversityVerifierad e-postadress på purdue.edu
Vijay GuptaElectrical and Computer Engineering, Purdue UniversityVerifierad e-postadress på purdue.edu
Vipul SharmaPhD Scholar, Purdue UniversityVerifierad e-postadress på purdue.edu
Zhuoran YangYale UniversityVerifierad e-postadress på yale.edu

Följ

Wesley A Suttle

U.S. Army Research Laboratory

Verifierad e-postadress på army.mil - Startsida

reinforcement learning multi-agent systems


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
A multi-agent off-policy actor-critic algorithm for distributed reinforcement learning W Suttle, Z Yang, K Zhang, Z Wang, T Başar, J Liu IFAC-PapersOnLine 53 (2), 1549-1554, 2020	81	2020
Beyond exponentially fast mixing in average-reward reinforcement learning via multi-level Monte Carlo actor-critic WA Suttle, A Bedi, B Patel, BM Sadler, A Koppel, D Manocha International Conference on Machine Learning, 33240-33267, 2023	14	2023
Lancar: Leveraging language for context-aware robot locomotion in unstructured environments CL Shek, X Wu, WA Suttle, C Busart, E Zaroukian, D Manocha, P Tokekar, ... 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024	10	2024
Reinforcement learning for cost-aware Markov decision processes W Suttle, K Zhang, Z Yang, J Liu, D Kraemer International Conference on Machine Learning, 9989-9999, 2021	9	2021
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, AS Bedi, ... arXiv preprint arXiv:2403.11925, 2024	4	2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling U Singh, WA Suttle, BM Sadler, VP Namboodiri, AS Bedi arXiv preprint arXiv:2404.13423, 2024	3	2024
Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks MY Fatemi, WA Suttle, BM Sadler arXiv preprint arXiv:2402.06552, 2024	3	2024
Ada-nav: Adaptive trajectory-based sample efficient policy learning for robotic navigation B Patel, K Weerakoon, WA Suttle, A Koppel, BM Sadler, AS Bedi, ... arXiv preprint arXiv:2306.06192, 2023	3	2023
Reinforcement learning based distributed control of dissipative networked systems KC Kosaraju, S Sivaranjani, W Suttle, V Gupta, J Liu IEEE Transactions on Control of Network Systems 9 (2), 856-866, 2021	3	2021
Occupancy information ratio: Infinite-horizon, information-directed, parameterized policy search WA Suttle, A Koppel, J Liu SIAM Journal on Control and Optimization 62 (6), 3145-3171, 2024	2	2024
AIME: AI System Optimization via Multiple LLM Evaluators B Patel, S Chakraborty, WA Suttle, M Wang, AS Bedi, D Manocha arXiv preprint arXiv:2410.03131, 2024	2	2024
A Convergence Result for Regularized Actor-Critic Methods W Suttle, Z Yang, K Zhang, J Liu arXiv preprint arXiv:1907.06138, 2019	2	2019
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, D Manocha, A Bedi Forty-first International Conference on Machine Learning, 0	1
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning WA Suttle, A Suresh, C Nieto-Granda arXiv preprint arXiv:2502.04141, 2025		2025
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction U Singh, S Chakraborty, WA Suttle, BM Sadler, AK Sahu, M Shah, ... arXiv preprint arXiv:2411.00361, 2024		2024
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning U Singh, S Chakraborty, WA Suttle, BM Sadler, VP Namboodiri, AS Bedi arXiv preprint arXiv:2406.10892, 2024		2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems W Suttle, VK Sharma, KC Kosaraju, S Seetharaman, J Liu, V Gupta, ... International Conference on Artificial Intelligence and Statistics, 4420-4428, 2024		2024
Ada-NAV: Adaptive Trajectory Length-Based Sample Efficient Policy Learning for Robotic Navigation B Patel, K Weerakoon, WA Suttle, A Koppel, BM Sadler, T Zhou, ... arXiv e-prints, arXiv: 2306.06192, 2023		2023
Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio WA Suttle, A Koppel, J Liu 2023 57th Annual Conference on Information Sciences and Systems (CISS), 1-6, 2023		2023
Policy Gradient for Ratio Optimization: A Case Study WA Suttle, A Koppel, J Liu 2022 56th Annual Conference on Information Sciences and Systems (CISS), 281-286, 2022		2022

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare