Följ
Wesley A Suttle
Wesley A Suttle
U.S. Army Research Laboratory
Verifierad e-postadress på army.mil - Startsida
Titel
Citeras av
Citeras av
År
A multi-agent off-policy actor-critic algorithm for distributed reinforcement learning
W Suttle, Z Yang, K Zhang, Z Wang, T Başar, J Liu
IFAC-PapersOnLine 53 (2), 1549-1554, 2020
812020
Beyond exponentially fast mixing in average-reward reinforcement learning via multi-level Monte Carlo actor-critic
WA Suttle, A Bedi, B Patel, BM Sadler, A Koppel, D Manocha
International Conference on Machine Learning, 33240-33267, 2023
142023
Lancar: Leveraging language for context-aware robot locomotion in unstructured environments
CL Shek, X Wu, WA Suttle, C Busart, E Zaroukian, D Manocha, P Tokekar, ...
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024
102024
Reinforcement learning for cost-aware Markov decision processes
W Suttle, K Zhang, Z Yang, J Liu, D Kraemer
International Conference on Machine Learning, 9989-9999, 2021
92021
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic
B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, AS Bedi, ...
arXiv preprint arXiv:2403.11925, 2024
42024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
U Singh, WA Suttle, BM Sadler, VP Namboodiri, AS Bedi
arXiv preprint arXiv:2404.13423, 2024
32024
Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks
MY Fatemi, WA Suttle, BM Sadler
arXiv preprint arXiv:2402.06552, 2024
32024
Ada-nav: Adaptive trajectory-based sample efficient policy learning for robotic navigation
B Patel, K Weerakoon, WA Suttle, A Koppel, BM Sadler, AS Bedi, ...
arXiv preprint arXiv:2306.06192, 2023
32023
Reinforcement learning based distributed control of dissipative networked systems
KC Kosaraju, S Sivaranjani, W Suttle, V Gupta, J Liu
IEEE Transactions on Control of Network Systems 9 (2), 856-866, 2021
32021
Occupancy information ratio: Infinite-horizon, information-directed, parameterized policy search
WA Suttle, A Koppel, J Liu
SIAM Journal on Control and Optimization 62 (6), 3145-3171, 2024
22024
AIME: AI System Optimization via Multiple LLM Evaluators
B Patel, S Chakraborty, WA Suttle, M Wang, AS Bedi, D Manocha
arXiv preprint arXiv:2410.03131, 2024
22024
A Convergence Result for Regularized Actor-Critic Methods
W Suttle, Z Yang, K Zhang, J Liu
arXiv preprint arXiv:1907.06138, 2019
22019
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, D Manocha, A Bedi
Forty-first International Conference on Machine Learning, 0
1
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
WA Suttle, A Suresh, C Nieto-Granda
arXiv preprint arXiv:2502.04141, 2025
2025
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction
U Singh, S Chakraborty, WA Suttle, BM Sadler, AK Sahu, M Shah, ...
arXiv preprint arXiv:2411.00361, 2024
2024
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
U Singh, S Chakraborty, WA Suttle, BM Sadler, VP Namboodiri, AS Bedi
arXiv preprint arXiv:2406.10892, 2024
2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
W Suttle, VK Sharma, KC Kosaraju, S Seetharaman, J Liu, V Gupta, ...
International Conference on Artificial Intelligence and Statistics, 4420-4428, 2024
2024
Ada-NAV: Adaptive Trajectory Length-Based Sample Efficient Policy Learning for Robotic Navigation
B Patel, K Weerakoon, WA Suttle, A Koppel, BM Sadler, T Zhou, ...
arXiv e-prints, arXiv: 2306.06192, 2023
2023
Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio
WA Suttle, A Koppel, J Liu
2023 57th Annual Conference on Information Sciences and Systems (CISS), 1-6, 2023
2023
Policy Gradient for Ratio Optimization: A Case Study
WA Suttle, A Koppel, J Liu
2022 56th Annual Conference on Information Sciences and Systems (CISS), 281-286, 2022
2022
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20