A practical guide to multi-objective reinforcement learning and planning
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …
between multiple, often conflicting, objectives. Despite this, the majority of research in …
FALCON-FArm Level CONtrol for wind turbines using multi-agent deep reinforcement learning
Turbines in a wind farm dynamically influence each other through wakes. Therefore trade-
offs exist between energy output of upstream turbines and the health of downstream …
offs exist between energy output of upstream turbines and the health of downstream …
Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs
We study the multi-agent multi-armed bandit (MAMAB) problem, where agents are factored
into overlap** groups. Each group represents a hyperedge, forming a hypergraph over …
into overlap** groups. Each group represents a hyperedge, forming a hypergraph over …
[PDF][PDF] Deep reinforcement learning for active wake control
G Neustroev, SPE Andringa, RA Verzijlbergh… - Proceedings of the 21st …, 2022 - ifaamas.org
Wind farms suffer from so-called wake effects: when turbines are located in the wind
shadows of other turbines, their power output is substantially reduced. These losses can be …
shadows of other turbines, their power output is substantially reduced. These losses can be …
Deep reinforcement learning for the direct optimization of gradient separations in liquid chromatography
Abstract While Reinforcement Learning (RL) has already proven successful in performing
complex tasks, such as controlling large-scale epidemics, mitigating influenza and playing …
complex tasks, such as controlling large-scale epidemics, mitigating influenza and playing …
Multi-objective coordination graphs for the expected scalarised returns with generative flow models
Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …
between objectives. Key to solving such problems is to exploit sparse dependency …
Hyperparameter tuning framework for calibrating analytical wake models using SCADA data of an offshore wind farm applied on FLORIS
D van Binsbergen, PJ Daems… - Wind Energy …, 2023 - wes.copernicus.org
Calibrating analytical wake models for wind farm yield assessment and wind farm flow
control presents significant challenges. This study provides a robust methodology for the …
control presents significant challenges. This study provides a robust methodology for the …
[HTML][HTML] Hyperparameter tuning framework for calibrating analytical wake models using SCADA data of an offshore wind farm
D van Binsbergen, PJ Daems… - Wind Energy …, 2024 - wes.copernicus.org
This work presents a robust methodology for calibrating analytical wake models, as
demonstrated on the velocity deficit parameters of the Gauss–curl hybrid model using 4 …
demonstrated on the velocity deficit parameters of the Gauss–curl hybrid model using 4 …
Wind Power Prediction using Multi-Task Gaussian Process Regression with Lagged Inputs
Wind is a renewable energy source that has become more important in recent years. Wind
turbines are equipped with a SCADA system, which allows for remote supervision of the …
turbines are equipped with a SCADA system, which allows for remote supervision of the …
Consensus-Based Thompson Sampling for Stochastic Multiarmed Bandits
N Hayashi - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org
This article considers a distributed Thompson sampling algorithm for a cooperative
multiplayer multiarmed bandit problem. We consider a multiagent system in which each …
multiplayer multiarmed bandit problem. We consider a multiagent system in which each …