Deep learning in neural networks: An overview
J Schmidhuber - Neural networks, 2015 - Elsevier
In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …
numerous contests in pattern recognition and machine learning. This historical survey …
Reinforcement learning: A tutorial survey and recent advances
A Gosavi - INFORMS Journal on Computing, 2009 - pubsonline.informs.org
In the last few years, reinforcement learning (RL), also called adaptive (or approximate)
dynamic programming, has emerged as a powerful tool for solving complex sequential …
dynamic programming, has emerged as a powerful tool for solving complex sequential …
Stochastic Mechanics Applications of
A Board - 2003 - Springer
The original work in recursive stochastic algorithms was by Robbins and Monro, who
developed and analyzed a recursive procedure for finding the root of a real-valued function …
developed and analyzed a recursive procedure for finding the root of a real-valued function …
Fully decentralized multi-agent reinforcement learning with networked agents
We consider the fully decentralized multi-agent reinforcement learning (MARL) problem,
where the agents are connected via a time-varying and possibly sparse communication …
where the agents are connected via a time-varying and possibly sparse communication …
[BOOK][B] Stochastic approximation: a dynamical systems viewpoint
VS Borkar, VS Borkar - 2008 - Springer
Stochastic approximation was introduced in a 1951 article in the Annals of Mathematical
Statistics by Robbins and Monro. Originally conceived as a tool for statistical computation …
Statistics by Robbins and Monro. Originally conceived as a tool for statistical computation …
Incremental natural actor-critic algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-
gradient ideas, and provide their convergence proofs. Actor-critic rein-forcement learning …
gradient ideas, and provide their convergence proofs. Actor-critic rein-forcement learning …
[BOOK][B] Simulation-based optimization
A Gosavi - 2015 - Springer
This book is written for students and researchers in the field of industrial engineering,
computer science, operations research, management science, electrical engineering, and …
computer science, operations research, management science, electrical engineering, and …
The ODE method for convergence of stochastic approximation and reinforcement learning
It is shown here that stability of the stochastic approximation algorithm is implied by the
asymptotic stability of the origin for an associated ODE. This in turn implies convergence of …
asymptotic stability of the origin for an associated ODE. This in turn implies convergence of …
Joint status sampling and updating for minimizing age of information in the Internet of Things
The effective operation of time-critical Internet of things (IoT) applications requires real-time
reporting of fresh status information of underlying physical processes. In this paper, a real …
reporting of fresh status information of underlying physical processes. In this paper, a real …
Distributed constraint optimization problems and applications: A survey
The field of multi-agent system (MAS) is an active area of research within artificial
intelligence, with an increasingly important impact in industrial and other real-world …
intelligence, with an increasingly important impact in industrial and other real-world …