Evolutionary reinforcement learning: A survey

H Bai, R Cheng, Y ** - Intelligent Computing, 2023 - spj.science.org
Reinforcement learning (RL) is a machine learning approach that trains agents to maximize
cumulative rewards through interactions with environments. The integration of RL with deep …

Personalized soups: Personalized large language model alignment via post-hoc parameter merging

J Jang, S Kim, BY Lin, Y Wang, J Hessel… - arxiv preprint arxiv …, 2023 - arxiv.org
While Reinforcement Learning from Human Feedback (RLHF) aligns Large Language
Models (LLMs) with general, aggregate human preferences, it is suboptimal for learning …

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

A toolkit for reliable benchmarking and research in multi-objective reinforcement learning

F Felten, LN Alegre, A Nowe… - Advances in …, 2023 - proceedings.neurips.cc
Multi-objective reinforcement learning algorithms (MORL) extend standard reinforcement
learning (RL) to scenarios where agents must optimize multiple---potentially conflicting …

Evolutionary multi-objective reinforcement learning based trajectory control and task offloading in UAV-assisted mobile edge computing

F Song, H **ng, X Wang, S Luo, P Dai… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
This article studies the trajectory control and task offloading (TCTO) problem in an
unmanned aerial vehicle (UAV)-assisted mobile edge computing system, where a UAV flies …

MO-MIX: Multi-objective multi-agent cooperative decision-making with deep reinforcement learning

T Hu, B Luo, C Yang, T Huang - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (RL) has been applied extensively to solve complex decision-
making problems. In many real-world scenarios, tasks often have several conflicting …

Hypervolume maximization: A geometric view of pareto set learning

X Zhang, X Lin, B Xue, Y Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc
This paper presents a novel approach to multiobjective algorithms aimed at modeling the
Pareto set using neural networks. Whereas previous methods mainly focused on identifying …

Discovering high entropy alloy electrocatalysts in vast composition spaces with multiobjective optimization

W Xu, E Diesen, T He, K Reuter… - Journal of the American …, 2024 - ACS Publications
High entropy alloys (HEAs) are a highly promising class of materials for electrocatalysis as
their unique active site distributions break the scaling relations that limit the activity of …

Reducing idleness in financial cloud services via multi-objective evolutionary reinforcement learning based load balancer

P Yang, L Zhang, H Liu, G Li - Science China Information Sciences, 2024 - Springer
In recent years, various companies have started to shift their data services from traditional
data centers to the cloud. One of the major motivations is to save on operational costs with …

Collaborative ground-space communications via evolutionary multi-objective deep reinforcement learning

J Li, G Sun, Q Wu, D Niyato, J Kang… - IEEE Journal on …, 2024 - ieeexplore.ieee.org
Low Earth Orbit (LEO) satellites have emerged as crucial enablers of direct connections with
remote terrestrial terminals. However, energy limitations and insufficient antenna capabilities …