Best Arm Identification for Stochastic Rising Bandits M Mussi, A Montenegro, F Trovó, M Restelli, AM Metelli arXiv preprint arXiv:2302.07510, 2023 | 6 | 2023 |
Last-iterate global convergence of policy gradients for constrained reinforcement learning A Montenegro, M Mussi, M Papini, AM Metelli arXiv preprint arXiv:2407.10775, 2024 | 1 | 2024 |
Learning Optimal Deterministic Policies with Stochastic Policy Gradients A Montenegro, M Mussi, AM Metelli, M Papini arXiv preprint arXiv:2405.02235, 2024 | 1 | 2024 |
Best model selection via stochastic rising bandits A Montenegro | | 2022 |
Stochastic Rising Bandits: A Best Arm Identification Approach A Montenegro, M Mussi, F Trovò, M Restelli, AM Metelli Sixteenth European Workshop on Reinforcement Learning, 0 | | |
A Best Arm Identification Approach for Stochastic Rising Bandits A Montenegro, M Mussi, F Trovò, M Restelli, AM Metelli ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 0 | | |