- Academic Search

J Schmidhuber - Neural networks, 2015 - Elsevier

In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …

Save Cite Cited by 24074 Related articles All 42 versions Free GPT-4

[Free GPT-4]

[HTML] acm.org

Reinforcement learning: A tutorial survey and recent advances

A Gosavi - INFORMS Journal on Computing, 2009 - pubsonline.informs.org

In the last few years, reinforcement learning (RL), also called adaptive (or approximate)
dynamic programming, has emerged as a powerful tool for solving complex sequential …

Save Cite Cited by 453 Related articles All 15 versions Free GPT-4

Stochastic Mechanics Applications of

A Board - 2003 - Springer

The original work in recursive stochastic algorithms was by Robbins and Monro, who
developed and analyzed a recursive procedure for finding the root of a real-valued function …

Save Cite Cited by 4811 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Fully decentralized multi-agent reinforcement learning with networked agents

K Zhang, Z Yang, H Liu, T Zhang… - … conference on machine …, 2018 - proceedings.mlr.press

We consider the fully decentralized multi-agent reinforcement learning (MARL) problem,
where the agents are connected via a time-varying and possibly sparse communication …

Save Cite Cited by 739 Related articles All 8 versions Free GPT-4 View as HTML

[BOOK][B] Stochastic approximation: a dynamical systems viewpoint

VS Borkar, VS Borkar - 2008 - Springer

Stochastic approximation was introduced in a 1951 article in the Annals of Mathematical
Statistics by Robbins and Monro. Originally conceived as a tool for statistical computation …

Save Cite Cited by 1979 Related articles All 14 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] neurips.cc

Incremental natural actor-critic algorithms

S Bhatnagar, M Ghavamzadeh… - Advances in neural …, 2007 - proceedings.neurips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-
gradient ideas, and provide their convergence proofs. Actor-critic rein-forcement learning …

Save Cite Cited by 1115 Related articles All 33 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] academia.edu

[BOOK][B] Simulation-based optimization

A Gosavi - 2015 - Springer

This book is written for students and researchers in the field of industrial engineering,
computer science, operations research, management science, electrical engineering, and …

Save Cite Cited by 848 Related articles All 11 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] ias.ac.in

The ODE method for convergence of stochastic approximation and reinforcement learning

VS Borkar, SP Meyn - SIAM Journal on Control and Optimization, 2000 - SIAM

It is shown here that stability of the stochastic approximation algorithm is implied by the
asymptotic stability of the origin for an associated ODE. This in turn implies convergence of …

Save Cite Cited by 691 Related articles All 13 versions Free GPT-4

[Free GPT-4]

[PDF] ieee.org

Joint status sampling and updating for minimizing age of information in the Internet of Things

B Zhou, W Saad - IEEE Transactions on Communications, 2019 - ieeexplore.ieee.org

The effective operation of time-critical Internet of things (IoT) applications requires real-time
reporting of fresh status information of underlying physical processes. In this paper, a real …

Save Cite Cited by 227 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] jair.org

Distributed constraint optimization problems and applications: A survey

F Fioretto, E Pontelli, W Yeoh - Journal of Artificial Intelligence Research, 2018 - jair.org

The field of multi-agent system (MAS) is an active area of research within artificial
intelligence, with an increasingly important impact in industrial and other real-world …

Save Cite Cited by 288 Related articles All 17 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Learning algorithms for Markov decision processes with average cost

Deep learning in neural networks: An overview

Reinforcement learning: A tutorial survey and recent advances

Stochastic Mechanics Applications of

Fully decentralized multi-agent reinforcement learning with networked agents

[BOOK][B] Stochastic approximation: a dynamical systems viewpoint

Incremental natural actor-critic algorithms

[BOOK][B] Simulation-based optimization

The ODE method for convergence of stochastic approximation and reinforcement learning

Joint status sampling and updating for minimizing age of information in the Internet of Things

Distributed constraint optimization problems and applications: A survey