Obserwuj
Mirco Mutti
Tytuł
Cytowane przez
Cytowane przez
Rok
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
M Mutti, L Pratissoli, M Restelli
AAAI 2021, 2021
71*2021
Configurable Markov Decision Processes
AM Metelli, M Mutti, M Restelli
ICML 2018, 2018
502018
The Importance of Non-Markovianity in Maximum State Entropy Exploration
M Mutti, R De Santi, M Restelli
ICML 2022, 2022
312022
Unsupervised Reinforcement Learning in Multiple Environments
M Mutti, M Mancassola, M Restelli
AAAI 2022, 2022
282022
An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies
M Mutti, M Restelli
AAAI 2020, 2019
272019
Challenging Common Assumptions in Convex Reinforcement Learning
M Mutti, R De Santi, P De Bartolomeis, M Restelli
NeurIPS 2022, 2022
242022
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
M Mutti, R De Santi, E Rossi, JF Calderon, M Bronstein, M Restelli
AAAI 2023, 2022
19*2022
Convex Reinforcement Learning in Finite Trials
M Mutti, R De Santi, P De Bartolomeis, M Restelli
JMLR 24 (250), 1-42, 2023
142023
Persuading Farsighted Receivers in MDPs: the Power of Honesty
M Bernasconi, M Castiglioni, A Marchesi, M Mutti
NeurIPS 2023, 2023
62023
Reward-Free Policy Space Compression for Reinforcement Learning
M Mutti, S Del Col, M Restelli
AISTATS 2022, 2022
52022
A Framework for Partially Observed Reward-States in RLHF
C Kausik, M Mutti, A Pacchiano, A Tewari
arXiv preprint arXiv:2402.03282, 2024
42024
Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms
F Lazzati, M Mutti, AM Metelli
ICML 2024, 2024
32024
Unsupervised Reinforcement Learning via State Entropy Maximization
M Mutti
PhD Thesis, Università di Bologna, 2023
32023
Test-Time Regret Minimization in Meta Reinforcement Learning
M Mutti, A Tamar
ICML 2024, 2024
22024
A Tale of Sampling and Estimation in Discounted Reinforcement Learning
AM Metelli, M Mutti, M Restelli
AISTATS 2023, 2023
22023
How to Explore with Belief: State Entropy Maximization in POMDPs
R Zamboni, D Cirino, M Restelli, M Mutti
ICML 2024, 2024
12024
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
M Mutti, R De Santi, M Restelli, A Marx, G Ramponi
ICLR 2024, 2024
12024
Non-Markovian Policies for Unsupervised Reinforcement Learning in Multiple Environments
P Maldini, M Mutti, R De Santi, M Restelli
First Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward at …, 2022
12022
Reward Compatibility: A Framework for Inverse RL
F Lazzati, M Mutti, A Metelli
arXiv preprint arXiv:2501.07996, 2025
2025
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
R Zamboni, D Cirino, M Restelli, M Mutti
arXiv preprint arXiv:2406.12795, 2024
2024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20