Академия Google

A Gosavi - INFORMS Journal on Computing, 2009 - pubsonline.informs.org

In the last few years, reinforcement learning (RL), also called adaptive (or approximate)
dynamic programming, has emerged as a powerful tool for solving complex sequential …

Сохранить Цитировать Цитируется: 455 Похожие статьи Все версии статьи (15)

[КНИГА][B] Control systems and reinforcement learning

S Meyn - 2022 - books.google.com

A high school student can create deep Q-learning code to control her robot, without any
understanding of the meaning of'deep'or'Q', or why the code sometimes fails. This book is …

Сохранить Цитировать Цитируется: 158 Похожие статьи Все версии статьи (3) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

[КНИГА][B] Markov chains: Basic definitions

R Douc, E Moulines, P Priouret, P Soulier, R Douc… - 2018 - Springer

Heuristically, a discrete-time stochastic process has the Markov property if the past and
future are independent given the present. In this introductory chapter, we give the formal …

Сохранить Цитировать Цитируется: 397 Похожие статьи Все версии статьи (7) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] tandfonline.com

Operational Research: methods and applications

F Petropoulos, G Laporte, E Aktas… - Journal of the …, 2024 - Taylor & Francis

Abstract Throughout its history, Operational Research has evolved to include methods,
models and algorithms that have been applied to a wide range of contexts. This …

Сохранить Цитировать Цитируется: 43 Похожие статьи Все версии статьи (36)

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[КНИГА][B] Markov chains and stochastic stability

SP Meyn, RL Tweedie - 2012 - books.google.com

Markov Chains and Stochastic Stability is part of the Communications and Control
Engineering Series (CCES) edited by Professors BW Dickinson, ED Sontag, M. Thoma, A …

Сохранить Цитировать Цитируется: 8701 Похожие статьи Все версии статьи (12) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] ias.ac.in

The ODE method for convergence of stochastic approximation and reinforcement learning

VS Borkar, SP Meyn - SIAM Journal on Control and Optimization, 2000 - SIAM

It is shown here that stability of the stochastic approximation algorithm is implied by the
asymptotic stability of the origin for an associated ODE. This in turn implies convergence of …

Сохранить Цитировать Цитируется: 696 Похожие статьи Все версии статьи (13)

[Free GPT-4]
[DeepSeek]

[PDF] projecteuclid.org

On positive Harris recurrence of multiclass queueing networks: a unified approach via fluid limit models

JG Dai - The Annals of Applied Probability, 1995 - projecteuclid.org

It is now known that the usual traffic condition (the nominal load being less than 1 at each
station) is not sufficient for stability for a multiclass open queueing network. Although there …

Сохранить Цитировать Цитируется: 1111 Похожие статьи Все версии статьи (10)

[Free GPT-4]
[DeepSeek]

[PDF] ethernet.edu.et

Stochastic networked control systems

S Yüksel, T Basar - AMC, 2013 - Springer

Our goal in writing this book has been to provide a comprehensive, mathematically rigorous,
but still accessible treatment of the interaction between information and control in multi …

Сохранить Цитировать Цитируется: 386 Похожие статьи Все версии статьи (8) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[КНИГА][B] Control techniques for complex networks

S Meyn - 2008 - books.google.com

Power grids, flexible manufacturing, cellular communications: interconnectedness has
consequences. This remarkable book gives the tools and philosophy you need to build …

Сохранить Цитировать Цитируется: 689 Похожие статьи Все версии статьи (13) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] Scheduling for multiple flows sharing a time-varying channel: The exponential rule

S Shakkottai, AL Stolyar - Translations of the American …, 2002 - researchgate.net

We consider the following queueing system which arises as a model of a wireless link
shared by multiple users. Multiple ows must be served by a\channel"(server). The channel …

Сохранить Цитировать Цитируется: 515 Похожие статьи Все версии статьи (10) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Stability and convergence of moments for multiclass queueing networks via fluid limit models

Reinforcement learning: A tutorial survey and recent advances

[КНИГА][B] Control systems and reinforcement learning

[КНИГА][B] Markov chains: Basic definitions

Operational Research: methods and applications

[КНИГА][B] Markov chains and stochastic stability

The ODE method for convergence of stochastic approximation and reinforcement learning

On positive Harris recurrence of multiclass queueing networks: a unified approach via fluid limit models

Stochastic networked control systems

[КНИГА][B] Control techniques for complex networks

[PDF][PDF] Scheduling for multiple flows sharing a time-varying channel: The exponential rule