- Academic Search

Tuning-free step-size adaptation

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Reinforcement learning for predictive maintenance: A systematic technical review

R Siraskar, S Kumar, S Patil, A Bongale… - Artificial Intelligence …, 2023 - Springer

The manufacturing world is subject to ever-increasing cost optimization pressures.
Maintenance adds to cost and disrupts production; optimized maintenance is therefore of …

Lagre Referanse Sitert av 38 Beslektede artikler Alle 3 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] romisatriawahono.net

Reinforcement learning algorithms with function approximation: Recent advances and applications

X Xu, L Zuo, Z Huang - Information sciences, 2014 - Elsevier

In recent years, the research on reinforcement learning (RL) has focused on function
approximation in learning prediction and control of Markov decision processes (MDPs). The …

Lagre Referanse Sitert av 231 Beslektede artikler Alle 5 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[BOK][B] Reinforcement learning: An introduction

RS Sutton, AG Barto - 1998 - cambridge.org

This book introduces a new approach to the study of systems, living or artificial, that can
learn from experience, and in so doing it indicates a new focus of activity within Artificial …

Lagre Referanse Sitert av 79582 Beslektede artikler Alle 50 versjoner Søk i biblioteket

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

[PDF][PDF] Policy evaluation with temporal differences: A survey and comparison

C Dann, G Neumann, J Peters - The Journal of Machine Learning …, 2014 - jmlr.org

Policy evaluation is an essential step in most reinforcement learning approaches. It yields a
value function, the quality assessment of states for a given policy, which can be used in a …

Lagre Referanse Sitert av 306 Beslektede artikler Alle 20 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

A unified view on multi-class support vector classification

Ü Doǧan, T Glasmachers, C Igel - The Journal of Machine Learning …, 2016 - dl.acm.org

A unified view on multi-class support vector machines (SVMs) is presented, covering most
prominent variants including the one-vs-all approach and the algorithms proposed by …

Lagre Referanse Sitert av 137 Beslektede artikler Alle 11 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Meta-AF: Meta-learning for adaptive filters

J Casebeer, NJ Bryan… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org

Adaptive filtering algorithms are pervasive throughout signal processing and have had a
material impact on a wide variety of domains including audio processing …

Lagre Referanse Sitert av 28 Beslektede artikler Alle 5 versjoner

Adaptive network traffic control with an integrated model-based and data-driven approach and a decentralised solution method

ZC Su, AHF Chow, RX Zhong - Transportation Research Part C: Emerging …, 2021 - Elsevier

This paper presents an adaptive traffic controller for stochastic road networks with an
integrated model-based and data-driven solution framework. The model-based optimisation …

Lagre Referanse Sitert av 39 Beslektede artikler Alle 5 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Adapting behavior via intrinsic reward: A survey and empirical study

C Linke, NM Ady, M White, T Degris, A White - Journal of artificial intelligence …, 2020 - jair.org

Learning about many things can provide numerous benefits to a reinforcement learning
system. For example, learning many auxiliary value functions, in addition to optimizing the …

Lagre Referanse Sitert av 54 Beslektede artikler Alle 8 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The Alberta plan for AI research

RS Sutton, M Bowling, PM Pilarski - arxiv preprint arxiv:2208.11173, 2022 - arxiv.org

arxiv:2208.11173v3 [cs.AI] 21 Mar 2023 Page 1 1 The Alberta Plan for AI Research Richard S.
Sutton, Michael Bowling, and Patrick M. Pilarski University of Alberta Alberta Machine …

Lagre Referanse Sitert av 20 Beslektede artikler Alle 10 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Searching for optimal per-coordinate step-sizes with multidimensional backtracking

F Kunstner, V Sanches Portella… - Advances in Neural …, 2023 - proceedings.neurips.cc

The backtracking line-search is an effective technique to automatically tune the step-size in
smooth optimization. It guarantees similar performance to using the theoretically optimal …

Lagre Referanse Sitert av 7 Beslektede artikler Alle 7 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Tuning-free step-size adaptation

Reinforcement learning for predictive maintenance: A systematic technical review

Reinforcement learning algorithms with function approximation: Recent advances and applications

[BOK][B] Reinforcement learning: An introduction

[PDF][PDF] Policy evaluation with temporal differences: A survey and comparison

A unified view on multi-class support vector classification

Meta-AF: Meta-learning for adaptive filters

Adaptive network traffic control with an integrated model-based and data-driven approach and a decentralised solution method

Adapting behavior via intrinsic reward: A survey and empirical study

The Alberta plan for AI research

Searching for optimal per-coordinate step-sizes with multidimensional backtracking