Safe multi-agent reinforcement learning via shielding I ElSayed-Aly, S Bharadwaj, C Amato, R Ehlers, U Topcu, L Feng arXiv preprint arXiv:2101.11196, 2021 | 103 | 2021 |
Temporal-logic-based reward shaping for continuing reinforcement learning tasks Y Jiang, S Bharadwaj, B Wu, R Shah, U Topcu, P Stone Proceedings of the AAAI Conference on artificial Intelligence 35 (9), 7995-8003, 2021 | 62 | 2021 |
Decentralized Control Synthesis for Air Traffic Management in Urban Air Mobility S Bharadwaj, SP Carr, NA Neogi, U Topcu IEEE Transactions on Control of Network Systems, 2021 | 42 | 2021 |
Synthesis of surveillance strategies via belief abstraction S Bharadwaj, R Dimitrova, U Topcu 2018 IEEE Conference on Decision and Control (CDC), 4159-4166, 2018 | 31 | 2018 |
Synthesis of minimum-cost shields for multi-agent systems S Bharadwaj, R Bloem, R Dimitrova, B Konighofer, U Topcu 2019 American Control Conference (ACC), 1048-1055, 2019 | 25 | 2019 |
Traffic management for urban air mobility S Bharadwaj, S Carr, N Neogi, H Poonawala, AB Chueca, U Topcu NASA Formal Methods: 11th International Symposium, NFM 2019, Houston, TX …, 2019 | 18 | 2019 |
The diver with a rotor S Bharadwaj, N Duignan, HR Dullin, K Leung, W Tong Indagationes Mathematicae 27 (5), 1147-1161, 2016 | 11 | 2016 |
Byzantine-resilient distributed hypothesis testing with time-varying network topology B Wu, S Carr, S Bharadwaj, Z Xu, U Topcu IEEE Transactions on Automatic Control 67 (7), 3243-3258, 2021 | 10 | 2021 |
Online synthesis for runtime enforcement of safety in multiagent systems D Raju, S Bharadwaj, F Djeumou, U Topcu IEEE Transactions on Control of Network Systems 8 (2), 621-632, 2021 | 10 | 2021 |
Reduction techniques for model checking and learning in MDPs S Bharadwaj, S Le Roux, G Pérez, U Topcu Proceedings of the 26st International Joint Conference on Artificial …, 2017 | 10 | 2017 |
Minimum-violation traffic management for urban air mobility S Bharadwaj, T Wongpiromsarn, N Neogi, J Muffoletto, U Topcu NASA Formal Methods Symposium, 37-52, 2021 | 9 | 2021 |
Safe policies for factored partially observable stochastic games S Carr, N Jansen, S Bharadwaj, M Spaan, U Topcu | 8 | 2021 |
Resilient distributed hypothesis testing with time-varying network topology B Wu, S Carr, S Bharadwaj, Z Xu, U Topcu 2020 American Control Conference (ACC), 1483-1488, 2020 | 7 | 2020 |
Reward-based deception with cognitive bias B Wu, M Cubuktepe, S Bharadwaj, U Topcu 2019 IEEE 58th Conference on Decision and Control (CDC), 2265-2270, 2019 | 7 | 2019 |
Cost-bounded active classification using partially observable Markov decision processes B Wu, M Ahmadi, S Bharadwaj, U Topcu 2019 American Control Conference (ACC), 1216-1223, 2019 | 7 | 2019 |
Stochastic games with sensing costs M Ahmadi, S Bharadwaj, T Tanaka, U Topcu 2018 56th Annual Allerton Conference on Communication, Control, and …, 2018 | 6 | 2018 |
Synthesis of minimum-cost shields for distributed systems S Bharadwaj, R Bloem, R Dimitrova, B Könighofer, U Topcu 2019 Annual American Control Conference, ACC, 10-12, 2019 | 5 | 2019 |
Distributed synthesis of surveillance strategies for mobile sensors S Bharadwaj, R Dimitrova, U Topcu 2018 IEEE Conference on Decision and Control (CDC), 3335-3342, 2018 | 5 | 2018 |
Transfer entropy in MDPs with temporal logic specifications S Bharadwaj, M Ahmadi, T Tanaka, U Topcu 2018 IEEE Conference on Decision and Control (CDC), 4173-4180, 2018 | 5 | 2018 |
Synthesis of strategies for autonomous surveillance on adversarial targets S Bharadwaj, R Dimitrova, J Quattrociocchi, U Topcu Robotics and Autonomous Systems 153, 104084, 2022 | 4 | 2022 |