Reinforcement learning applications in environmental sustainability: a review

M Zuccotto, A Castellini, DL Torre, L Mola… - Artificial Intelligence …, 2024 - Springer
Environmental sustainability is a worldwide key challenge attracting increasing attention due
to climate change, pollution, and biodiversity decline. Reinforcement learning, initially …

Learning logic specifications for policy guidance in pomdps: an inductive logic programming approach

D Meli, A Castellini, A Farinelli - Journal of Artificial Intelligence Research, 2024 - jair.org
Abstract Partially Observable Markov Decision Processes (POMDPs) are a powerful
framework for planning under uncertainty. They allow to model state uncertainty as a belief …

Online inductive learning from answer sets for efficient reinforcement learning exploration

C Veronese, D Meli, A Farinelli - arxiv preprint arxiv:2501.07445, 2025 - arxiv.org
This paper presents a novel approach combining inductive logic programming with
reinforcement learning to improve training performance and explainability. We exploit …

Scalable safe policy improvement for factored multi-agent MDPs

F Bianchi, E Zorzi, A Castellini, T Simao… - PROCEEDINGS OF …, 2024 - iris.univr.it
In this work, we focus on safe policy improvement in multi-agent domains where current
state-of-the-art methods cannot be effectively applied because of large state and action …

Unsupervised active visual search with monte carlo planning under uncertain detections

F Taioli, F Giuliari, Y Wang, R Berra… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
We propose a solution for Active Visual Search of objects in an environment, whose 2D floor
map is the only known information. Our solution has three key features that make it more …

Safe POMDP online planning via shielding

S Sheng, D Parker, L Feng - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Partially observable Markov decision processes (POMDPs) have been widely used in many
robotic applications for sequential decision-making under uncertainty. POMDP online …

Online model adaptation in Monte Carlo tree search planning

M Zuccotto, E Fusa, A Castellini, A Farinelli - Optimization and …, 2024 - Springer
We propose a model-based reinforcement learning method using Monte Carlo Tree Search
planning. The approach assumes a black-box approximated model of the environment …

Inductive Learning of Robot Task Knowledge from Raw Data and Online Expert Feedback

D Meli, P Fiorini - arxiv preprint arxiv:2501.07507, 2025 - arxiv.org
The increasing level of autonomy of robots poses challenges of trust and social acceptance,
especially in human-robot interaction scenarios. This requires an interpretable …

Monte Carlo planning for mobile robots in large action spaces with velocity obstacles

L Bonanni, D Meli, A Castellini, A Farinelli - CEUR WORKSHOP …, 2024 - iris.univr.it
Motion planning in dynamic environments is a challenging robotic task, requiring collision
avoidance and real-time computation. State-of-the-art online methods as Velocity Obstacles …

Monte Carlo Tree Search with Velocity Obstacles for safe and efficient motion planning in dynamic environments

L Bonanni, D Meli, A Castellini, A Farinelli - arxiv preprint arxiv …, 2025 - arxiv.org
Online motion planning is a challenging problem for intelligent robots moving in dense
environments with dynamic obstacles, eg, crowds. In this work, we propose a novel …