Reinforcement learning approaches in social robotics

N Akalin, A Loutfi - Sensors, 2021 - mdpi.com
This article surveys reinforcement learning approaches in social robotics. Reinforcement
learning is a framework for decision-making problems in which an agent interacts through …

Cooperative multi-agent learning: The state of the art

L Panait, S Luke - Autonomous agents and multi-agent systems, 2005 - Springer
Cooperative multi-agent systems (MAS) are ones in which several agents attempt, through
their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among the …

[LIVRE][B] Control systems and reinforcement learning

S Meyn - 2022 - books.google.com
A high school student can create deep Q-learning code to control her robot, without any
understanding of the meaning of'deep'or'Q', or why the code sometimes fails. This book is …

Simple statistical gradient-following algorithms for connectionist reinforcement learning

RJ Williams - Machine learning, 1992 - Springer
This article presents a general class of associative reinforcement learning algorithms for
connectionist networks containing stochastic units. These algorithms, called REINFORCE …

[LIVRE][B] Planning algorithms

SM LaValle - 2006 - books.google.com
Planning algorithms are impacting technical disciplines and industries around the world,
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …

[LIVRE][B] Theoretical neuroscience: computational and mathematical modeling of neural systems

P Dayan, LF Abbott - 2005 - books.google.com
Theoretical neuroscience provides a quantitative basis for describing what nervous systems
do, determining how they function, and uncovering the general principles by which they …

Markov games as a framework for multi-agent reinforcement learning

ML Littman - Machine learning proceedings 1994, 1994 - Elsevier
In the Markov decision process (MDP) formalization of reinforcement learning, a single
adaptive agent interacts with an environment defined by a probabilistic transition function. In …

Self-improving reactive agents based on reinforcement learning, planning and teaching

LJ Lin - Machine learning, 1992 - Springer
To date, reinforcement learning has mostly been studied solving simple learning tasks.
Reinforcement learning methods that have been studied so far typically converge slowly …

[LIVRE][B] Introduction to the theory of neural computation

JA Hertz - 2018 - taylorfrancis.com
INTRODUCTION TO THE THEORY OF NEURAL COMPUTATION Page 1 Page 2
INTRODUCTION TO THE THEORY OF NEURAL COMPUTATION Page 3 Page 4 …

Integrated architectures for learning, planning, and reacting based on approximating dynamic programming

RS Sutton - Machine learning proceedings 1990, 1990 - Elsevier
This paper extends previous work with Dyna, a class of architectures for intelligent systems
based on approximating dynamic programming methods. Dyna architectures integrate trial …