Ikuti
Mridul Agarwal
Mridul Agarwal
Amazon
Email yang diverifikasi di purdue.edu
Judul
Dikutip oleh
Dikutip oleh
Tahun
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022
722022
Multi-agent multi-armed bandits with limited communication
M Agarwal, V Aggarwal, K Azizzadenesheli
Journal of Machine Learning Research 23 (212), 1-24, 2022
472022
On the approximation of cooperative heterogeneous multi-agent reinforcement learning (marl) using mean field control (mfc)
WU Mondal, M Agarwal, V Aggarwal, SV Ukkusuri
Journal of Machine Learning Research 23 (129), 1-46, 2022
432022
Multi-objective reinforcement learning with non-linear scalarization
M Agarwal, V Aggarwal, T Lan
Proceedings of the 21st International Conference on Autonomous Agents and …, 2022
282022
Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization
M Agarwal, V Aggarwal, AK Umrawal, CJ Quinn
ACM/IMS Transactions on Data Science (TDS) 2 (4), 1-39, 2022
28*2022
Blind decision making: Reinforcement learning with delayed observations
M Agarwal, V Aggarwal
Pattern Recognition Letters 150, 176-182, 2021
272021
Deserts: Delay-tolerant semi-autonomous robot teleoperation for surgery
G Gonzalez, M Agarwal, MV Balakuntala, MM Rahman, U Kaur, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 12693 …, 2021
272021
Regret guarantees for model-based reinforcement learning with long-term average constraints
M Agarwal, Q Bai, V Aggarwal
Uncertainty in Artificial Intelligence, 22-31, 2022
24*2022
Transferring dexterous surgical skill knowledge between robots for semi-autonomous teleoperation
MM Rahman, N Sanchez-Tamayo, G Gonzalez, M Agarwal, V Aggarwal, ...
2019 28th IEEE International Conference on Robot and Human Interactive …, 2019
242019
An explore-then-commit algorithm for submodular maximization under full-bandit feedback
G Nie, M Agarwal, AK Umrawal, V Aggarwal, CJ Quinn
Uncertainty in Artificial Intelligence, 1541-1551, 2022
212022
Asap: A semi-autonomous precise system for telesurgery during communication delays
G Gonzalez, M Balakuntala, M Agarwal, T Low, B Knoth, AW Kirkpatrick, ...
IEEE Transactions on Medical Robotics and Bionics 5 (1), 66-78, 2023
192023
Reinforcement learning for joint optimization of multiple rewards
M Agarwal, V Aggarwal
arXiv preprint arXiv:1909.02940, 2019
19*2019
SARTRES: A semi-autonomous robot teleoperation environment for surgery
MM Rahman, MV Balakuntala, G Gonzalez, M Agarwal, U Kaur, ...
Computer Methods in Biomechanics and Biomedical Engineering: Imaging …, 2021
152021
Communication efficient parallel reinforcement learning
M Agarwal, B Ganguly, V Aggarwal
Uncertainty in Artificial Intelligence, 247-256, 2021
142021
Concave utility reinforcement learning with zero-constraint violations
M Agarwal, Q Bai, V Aggarwal
arXiv preprint arXiv:2109.05439, 2021
142021
Reinforcement learning for mean-field game
M Agarwal, V Aggarwal, A Ghosh, N Tiwari
Algorithms 15 (3), 73, 2022
132022
Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits
M Agarwal, V Aggarwal, AK Umrawal, C Quinn
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6557-6565, 2021
13*2021
Reinforcement learning for joint optimization of multiple rewards
M Agarwal, V Aggarwal
Journal of Machine Learning Research 24 (49), 1-41, 2023
102023
Learning-based online QoE optimization in multi-agent video streaming
Y Wang, M Agarwal, T Lan, V Aggarwal
Algorithms 15 (7), 227, 2022
102022
Joint optimization of multi-objective reinforcement learning with policy gradient based algorithm
Q Bai, M Agarwal, V Aggarwal
arXiv preprint arXiv:2105.14125, 2021
102021
Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.
Artikel 1–20