Distributional pareto-optimal multi-objective reinforcement learning

XQ Cai, P Zhang, L Zhao, J Bian… - Advances in …, 2023 - proceedings.neurips.cc
Multi-objective reinforcement learning (MORL) has been proposed to learn control policies
over multiple competing objectives with each possible preference over returns. However …

Actor-critic multi-objective reinforcement learning for non-linear utility functions

M Reymond, CF Hayes, D Steckelmacher… - Autonomous Agents and …, 2023 - Springer
We propose a novel multi-objective reinforcement learning algorithm that successfully learns
the optimal policy even for non-linear utility functions. Non-linear utility functions pose a …

Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning

CF Hayes, M Reymond, DM Roijers, E Howley… - Autonomous Agents and …, 2023 - Springer
In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from a single execution of a policy. In these settings, making decisions based on …

Multi-objective intelligent clustering routing schema for internet of things enabled wireless sensor networks using deep reinforcement learning

WK Ghamry, S Shukry - Cluster Computing, 2024 - Springer
Abstract The Internet of Things (IoT IoT) is built on a foundation of wireless sensor devices
that connect humans and physical objects to the Internet and enable them to interact with …

Distributional multi-objective decision making

W Röpke, CF Hayes, P Mannion, E Howley… - arxiv preprint arxiv …, 2023 - arxiv.org
For effective decision support in scenarios with conflicting objectives, sets of potentially
optimal solutions can be presented to the decision maker. We explore both what policies …

[PDF][PDF] Decision-theoretic planning for the expected scalarised returns

CF Hayes, DM Roijers, E Howley… - Proceedings of the 21st …, 2022 - ifmas.csc.liv.ac.uk
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

Multi-objective coordination graphs for the expected scalarised returns with generative flow models

CF Hayes, T Verstraeten, DM Roijers, E Howley… - arxiv preprint arxiv …, 2022 - arxiv.org
Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …

From fair solutions to compromise solutions in multi-objective deep reinforcement learning

J Qian, U Siddique, G Yu, P Weng - Neural Computing and Applications, 2025 - Springer
In this paper, we focus on multi-objective reinforcement learning (RL) where the expected
vector returns are aggregated with a concave function. For this generic framework, which …

[PDF][PDF] Multi-objective distributional value iteration

CF Hayes, DM Roijers, E Howley… - Adaptive and Learning …, 2022 - researchgate.net
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

[PDF][PDF] Multi-objective decision making for trustworthy ai

P Mannion, F Heintz… - … of the Multi …, 2021 - modem2021.cs.universityofgalway …
If widespread deployment of AI systems is to be accepted by society in the future, it is crucial
that such systems are trustworthy. Trustworthiness for autonomous systems has a number of …