Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

[책][B] Planning algorithms

SM LaValle - 2006 - books.google.com
Planning algorithms are impacting technical disciplines and industries around the world,
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …

Reinforcement learning: An introduction

RS Sutton - A Bradford Book, 2018 - books.google.com
The significantly expanded and updated new edition of a widely used text on reinforcement
learning, one of the most active research areas in artificial intelligence. Reinforcement …

Machine-learning research

TG Dietterich - AI magazine, 1997 - ojs.aaai.org
Abstract Machine-learning research has been making great progress in many directions.
This article summarizes four of these directions and discusses some current open problems …

[PDF][PDF] Learning agents for uncertain environments

S Russell - Proceedings of the eleventh annual conference on …, 1998 - dl.acm.org
This talk proposes a very simple “baseline architecture” for a learning agent that can handle
stochastic, partially observable environments. The architecture uses reinforcement learning …

Learning policies for partially observable environments: Scaling up

ML Littman, AR Cassandra, LP Kaelbling - Machine Learning Proceedings …, 1995 - Elsevier
Partially observable Markov decision processes (POMDP's) model decision problems in
which an agent tries to maximize its reward in the face of limited and/or noisy sensor …

Deep reinforcement learning with its application for lung cancer detection in medical Internet of Things

Z Liu, C Yao, H Yu, T Wu - Future Generation Computer Systems, 2019 - Elsevier
Recently, deep reinforcement learning has achieved great success by integrating deep
learning models into reinforcement learning algorithms in various applications such as …

Value-function approximations for partially observable Markov decision processes

M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org
Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …

Partially observable Markov decision processes

MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer
For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …