A review of deep reinforcement learning for smart building energy management

L Yu, S Qin, M Zhang, C Shen, T Jiang… - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org
Global buildings account for about 30% of the total energy consumption and carbon
emission, raising severe energy and environmental concerns. Therefore, it is significant and …

Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications

TT Nguyen, ND Nguyen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Reinforcement learning (RL) algorithms have been around for decades and employed to
solve various sequential decision-making problems. These algorithms, however, have faced …

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

A Rame, G Couairon, C Dancette… - Advances in …, 2024 - proceedings.neurips.cc
Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Deep reinforcement learning for cyber security

TT Nguyen, VJ Reddi - IEEE Transactions on Neural Networks …, 2021 - ieeexplore.ieee.org
The scale of Internet-connected systems has increased considerably, and these systems are
being exposed to cyberattacks more than ever. The complexity and dynamics of …

Levels of explainable artificial intelligence for human-aligned conversational explanations

R Dazeley, P Vamplew, C Foale, C Young, S Aryal… - Artificial Intelligence, 2021 - Elsevier
Over the last few years there has been rapid research growth into eXplainable Artificial
Intelligence (XAI) and the closely aligned Interpretable Machine Learning (IML). Drivers for …

Multi-objective multi-agent decision making: a utility-based analysis and survey

R Rădulescu, P Mannion, DM Roijers… - Autonomous Agents and …, 2020 - Springer
The majority of multi-agent system implementations aim to optimise agents' policies with
respect to a single objective, despite the fact that many real-world problem domains are …

Intelligent fault-tolerance data routing scheme for IoT-enabled WSNs

V Agarwal, S Tapaswi, P Chanak - IEEE Internet of Things …, 2022 - ieeexplore.ieee.org
Wireless sensor networks (WSNs) have become one of the essential components of the
Internet of Things (IoT). In any IoT application, different sensor-based devices gather data …

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

Offloading dependent tasks in multi-access edge computing: A multi-objective reinforcement learning approach

F Song, H **ng, X Wang, S Luo, P Dai, K Li - Future Generation Computer …, 2022 - Elsevier
This paper studies the problem of offloading an application consisting of dependent tasks in
multi-access edge computing (MEC). This problem is challenging because multiple …