A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

FALCON-FArm Level CONtrol for wind turbines using multi-agent deep reinforcement learning

VR Padullaparthi, S Nagarathinam, A Vasan… - Renewable Energy, 2022 - Elsevier
Turbines in a wind farm dynamically influence each other through wakes. Therefore trade-
offs exist between energy output of upstream turbines and the health of downstream …

Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs

T **, HL Hsu, W Chang, P Xu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
We study the multi-agent multi-armed bandit (MAMAB) problem, where agents are factored
into overlap** groups. Each group represents a hyperedge, forming a hypergraph over …

[PDF][PDF] Deep reinforcement learning for active wake control

G Neustroev, SPE Andringa, RA Verzijlbergh… - Proceedings of the 21st …, 2022 - ifaamas.org
Wind farms suffer from so-called wake effects: when turbines are located in the wind
shadows of other turbines, their power output is substantially reduced. These losses can be …

Deep reinforcement learning for the direct optimization of gradient separations in liquid chromatography

A Kensert, P Libin, G Desmet, D Cabooter - Journal of Chromatography A, 2024 - Elsevier
Abstract While Reinforcement Learning (RL) has already proven successful in performing
complex tasks, such as controlling large-scale epidemics, mitigating influenza and playing …

Multi-objective coordination graphs for the expected scalarised returns with generative flow models

CF Hayes, T Verstraeten, DM Roijers, E Howley… - arxiv preprint arxiv …, 2022 - arxiv.org
Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …

Hyperparameter tuning framework for calibrating analytical wake models using SCADA data of an offshore wind farm applied on FLORIS

D van Binsbergen, PJ Daems… - Wind Energy …, 2023 - wes.copernicus.org
Calibrating analytical wake models for wind farm yield assessment and wind farm flow
control presents significant challenges. This study provides a robust methodology for the …

[HTML][HTML] Hyperparameter tuning framework for calibrating analytical wake models using SCADA data of an offshore wind farm

D van Binsbergen, PJ Daems… - Wind Energy …, 2024 - wes.copernicus.org
This work presents a robust methodology for calibrating analytical wake models, as
demonstrated on the velocity deficit parameters of the Gauss–curl hybrid model using 4 …

Wind Power Prediction using Multi-Task Gaussian Process Regression with Lagged Inputs

FJ Ávila, T Verstraeten, K Vratsinis… - Journal of Physics …, 2023 - iopscience.iop.org
Wind is a renewable energy source that has become more important in recent years. Wind
turbines are equipped with a SCADA system, which allows for remote supervision of the …

Consensus-Based Thompson Sampling for Stochastic Multiarmed Bandits

N Hayashi - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org
This article considers a distributed Thompson sampling algorithm for a cooperative
multiplayer multiarmed bandit problem. We consider a multiagent system in which each …