Reinforcement learning for predictive maintenance: A systematic technical review

R Siraskar, S Kumar, S Patil, A Bongale… - Artificial Intelligence …, 2023 - Springer
The manufacturing world is subject to ever-increasing cost optimization pressures.
Maintenance adds to cost and disrupts production; optimized maintenance is therefore of …

Reinforcement learning algorithms with function approximation: Recent advances and applications

X Xu, L Zuo, Z Huang - Information sciences, 2014 - Elsevier
In recent years, the research on reinforcement learning (RL) has focused on function
approximation in learning prediction and control of Markov decision processes (MDPs). The …

[BOK][B] Reinforcement learning: An introduction

RS Sutton, AG Barto - 1998 - cambridge.org
This book introduces a new approach to the study of systems, living or artificial, that can
learn from experience, and in so doing it indicates a new focus of activity within Artificial …

[PDF][PDF] Policy evaluation with temporal differences: A survey and comparison

C Dann, G Neumann, J Peters - The Journal of Machine Learning …, 2014 - jmlr.org
Policy evaluation is an essential step in most reinforcement learning approaches. It yields a
value function, the quality assessment of states for a given policy, which can be used in a …

A unified view on multi-class support vector classification

Ü Doǧan, T Glasmachers, C Igel - The Journal of Machine Learning …, 2016 - dl.acm.org
A unified view on multi-class support vector machines (SVMs) is presented, covering most
prominent variants including the one-vs-all approach and the algorithms proposed by …

Meta-AF: Meta-learning for adaptive filters

J Casebeer, NJ Bryan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
Adaptive filtering algorithms are pervasive throughout signal processing and have had a
material impact on a wide variety of domains including audio processing …

Adaptive network traffic control with an integrated model-based and data-driven approach and a decentralised solution method

ZC Su, AHF Chow, RX Zhong - Transportation Research Part C: Emerging …, 2021 - Elsevier
This paper presents an adaptive traffic controller for stochastic road networks with an
integrated model-based and data-driven solution framework. The model-based optimisation …

Adapting behavior via intrinsic reward: A survey and empirical study

C Linke, NM Ady, M White, T Degris, A White - Journal of artificial intelligence …, 2020 - jair.org
Learning about many things can provide numerous benefits to a reinforcement learning
system. For example, learning many auxiliary value functions, in addition to optimizing the …

The Alberta plan for AI research

RS Sutton, M Bowling, PM Pilarski - arxiv preprint arxiv:2208.11173, 2022 - arxiv.org
arxiv:2208.11173v3 [cs.AI] 21 Mar 2023 Page 1 1 The Alberta Plan for AI Research Richard S.
Sutton, Michael Bowling, and Patrick M. Pilarski University of Alberta Alberta Machine …

Searching for optimal per-coordinate step-sizes with multidimensional backtracking

F Kunstner, V Sanches Portella… - Advances in Neural …, 2023 - proceedings.neurips.cc
The backtracking line-search is an effective technique to automatically tune the step-size in
smooth optimization. It guarantees similar performance to using the theoretically optimal …