Multi-agent reinforcement learning: A selective overview of theories and algorithms
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …
has registered tremendous success in solving various sequential decision-making problems …
Decision-theoretic planning: Structural assumptions and computational leverage
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …
decision making, and has been addressed by researchers in many different fields, including …
[책][B] Planning algorithms
SM LaValle - 2006 - books.google.com
Planning algorithms are impacting technical disciplines and industries around the world,
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …
Machine-learning research
TG Dietterich - AI magazine, 1997 - ojs.aaai.org
Abstract Machine-learning research has been making great progress in many directions.
This article summarizes four of these directions and discusses some current open problems …
This article summarizes four of these directions and discusses some current open problems …
[PDF][PDF] Learning agents for uncertain environments
S Russell - Proceedings of the eleventh annual conference on …, 1998 - dl.acm.org
This talk proposes a very simple “baseline architecture” for a learning agent that can handle
stochastic, partially observable environments. The architecture uses reinforcement learning …
stochastic, partially observable environments. The architecture uses reinforcement learning …
Learning policies for partially observable environments: Scaling up
Partially observable Markov decision processes (POMDP's) model decision problems in
which an agent tries to maximize its reward in the face of limited and/or noisy sensor …
which an agent tries to maximize its reward in the face of limited and/or noisy sensor …
Deep reinforcement learning with its application for lung cancer detection in medical Internet of Things
Z Liu, C Yao, H Yu, T Wu - Future Generation Computer Systems, 2019 - Elsevier
Recently, deep reinforcement learning has achieved great success by integrating deep
learning models into reinforcement learning algorithms in various applications such as …
learning models into reinforcement learning algorithms in various applications such as …
Value-function approximations for partially observable Markov decision processes
M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org
Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …
mathematical framework for modeling complex decision and planning problems in …
Partially observable Markov decision processes
MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer
For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …
signal, methods based on the Markov decision process (MDP) have had many successes. In …