Segueix
Shambhuraj Sawant
Shambhuraj Sawant
PhD Candidate at NTNU Trondheim
Correu electrònic verificat a ntnu.no - Pàgina d'inici
Títol
Citada per
Citada per
Any
Sample-efficient cross-entropy method for real-time planning
C Pinneri, S Sawant, S Blaes, J Achterhold, J Stueckler, M Rolinek, ...
Conference on Robot Learning, 2020
1192020
Extracting strong policies for robotics tasks from zero-order trajectory optimizers
C Pinneri*, S Sawant*, S Blaes, G Martius
International Conference on Learning Representations, 2020
132020
A learning-based model predictive control strategy for home energy management systems
W Cai, S Sawant, D Reinhardt, S Rastegarpour, S Gros
IEEE Access 11, 145264-145280, 2023
102023
Learning-based MPC from big data using reinforcement learning
S Sawant, AS Anand, D Reinhardt, S Gros
arXiv preprint arXiv:2301.01667, 2023
102023
Data-driven predictive control and MPC: Do we achieve optimality?
AS Anand, S Sawant, D Reinhardt, S Gros
IFAC-PapersOnLine 58 (15), 73-78, 2024
42024
A painless deterministic policy gradient method for learning-based MPC
AS Anand, D Reinhardt, S Sawant, JT Gravdahl, S Gros
2023 European Control Conference (ECC), 1-7, 2023
42023
Bridging the gap between QP-based and MPC-based Reinforcement Learning
S Sawant, S Gros
IFAC-PapersOnLine 55 (15), 7-12, 2022
32022
Economic Model Predictive Control as a Solution to Markov Decision Processes
D Reinhardt, AS Anand, S Sawant, S Gros
arXiv preprint arXiv:2407.16500, 2024
22024
Model-Free Data-Driven Predictive Control Using Reinforcement Learning
S Sawant, D Reinhardt, AB Kordabad, S Gros
2023 62nd IEEE Conference on Decision and Control (CDC), 4046-4052, 2023
22023
All AI Models are Wrong, but Some are Optimal
AS Anand, S Sawant, D Reinhardt, S Gros
arXiv preprint arXiv:2501.06086, 2025
12025
Hierarchical Reinforcement Learning for Spatio-temporal Planning
SV Sawant
2018
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–11