Distributional pareto-optimal multi-objective reinforcement learning
Multi-objective reinforcement learning (MORL) has been proposed to learn control policies
over multiple competing objectives with each possible preference over returns. However …
over multiple competing objectives with each possible preference over returns. However …
Actor-critic multi-objective reinforcement learning for non-linear utility functions
We propose a novel multi-objective reinforcement learning algorithm that successfully learns
the optimal policy even for non-linear utility functions. Non-linear utility functions pose a …
the optimal policy even for non-linear utility functions. Non-linear utility functions pose a …
Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning
In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from a single execution of a policy. In these settings, making decisions based on …
is derived from a single execution of a policy. In these settings, making decisions based on …
Multi-objective intelligent clustering routing schema for internet of things enabled wireless sensor networks using deep reinforcement learning
Abstract The Internet of Things (IoT IoT) is built on a foundation of wireless sensor devices
that connect humans and physical objects to the Internet and enable them to interact with …
that connect humans and physical objects to the Internet and enable them to interact with …
Distributional multi-objective decision making
For effective decision support in scenarios with conflicting objectives, sets of potentially
optimal solutions can be presented to the decision maker. We explore both what policies …
optimal solutions can be presented to the decision maker. We explore both what policies …
[PDF][PDF] Decision-theoretic planning for the expected scalarised returns
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …
derived from a single execution of a policy, policies for the expected scalarised returns …
Multi-objective coordination graphs for the expected scalarised returns with generative flow models
Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …
between objectives. Key to solving such problems is to exploit sparse dependency …
From fair solutions to compromise solutions in multi-objective deep reinforcement learning
J Qian, U Siddique, G Yu, P Weng - Neural Computing and Applications, 2025 - Springer
In this paper, we focus on multi-objective reinforcement learning (RL) where the expected
vector returns are aggregated with a concave function. For this generic framework, which …
vector returns are aggregated with a concave function. For this generic framework, which …
[PDF][PDF] Multi-objective distributional value iteration
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …
derived from a single execution of a policy, policies for the expected scalarised returns …
[PDF][PDF] Multi-objective decision making for trustworthy ai
If widespread deployment of AI systems is to be accepted by society in the future, it is crucial
that such systems are trustworthy. Trustworthiness for autonomous systems has a number of …
that such systems are trustworthy. Trustworthiness for autonomous systems has a number of …