Reinforcement learning for predictive maintenance: A systematic technical review

R Siraskar, S Kumar, S Patil, A Bongale… - Artificial Intelligence …, 2023 - Springer
The manufacturing world is subject to ever-increasing cost optimization pressures.
Maintenance adds to cost and disrupts production; optimized maintenance is therefore of …

A review of reward functions for reinforcement learning in the context of autonomous driving

A Abouelazm, J Michel… - 2024 IEEE Intelligent …, 2024 - ieeexplore.ieee.org
Reinforcement learning has emerged as an important approach for autonomous driving. A
reward function is used in reinforcement learning to establish the learned skill objectives …

Guarantees for epsilon-greedy reinforcement learning with function approximation

C Dann, Y Mansour, M Mohri… - International …, 2022 - proceedings.mlr.press
Myopic exploration policies such as epsilon-greedy, softmax, or Gaussian noise fail to
explore efficiently in some reinforcement learning tasks and yet, they perform well in many …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Operational optimization for off-grid renewable building energy system using deep reinforcement learning

Y Gao, Y Matsunami, S Miyata, Y Akashi - Applied Energy, 2022 - Elsevier
With the application of renewable energy in single office buildings, an increasing number of
power grids require building systems coupled with renewable energy to realize off-grid …

Learning reward machines for partially observable reinforcement learning

R Toro Icarte, E Waldie, T Klassen… - Advances in neural …, 2019 - proceedings.neurips.cc
Abstract Reward Machines (RMs), originally proposed for specifying problems in
Reinforcement Learning (RL), provide a structured, automata-based representation of a …

Multi-modal policy fusion for end-to-end autonomous driving

Z Huang, S Sun, J Zhao, L Mao - Information Fusion, 2023 - Elsevier
Multi-modal learning has made impressive progress in autonomous driving by leveraging
information from multiple sensors. Existing feature fusion methods make decisions by …

Grounding complex natural language commands for temporal tasks in unseen environments

JX Liu, Z Yang, I Idrees, S Liang… - … on Robot Learning, 2023 - proceedings.mlr.press
Grounding navigational commands to linear temporal logic (LTL) leverages its unambiguous
semantics for reasoning about long-horizon tasks and verifying the satisfaction of temporal …

Structure in deep reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …