Google 학술 검색

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

저장 인용 1716회 인용 관련 학술자료 전체 8개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org

Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[책][B] Planning algorithms

SM LaValle - 2006 - books.google.com

Planning algorithms are impacting technical disciplines and industries around the world,
including robotics, computer-aided design, manufacturing, computer graphics, aerospace …

[Free GPT-4]
[DeepSeek]

[PDF] umbc.edu

Reinforcement learning: An introduction

RS Sutton - A Bradford Book, 2018 - books.google.com

The significantly expanded and updated new edition of a widely used text on reinforcement
learning, one of the most active research areas in artificial intelligence. Reinforcement …

저장 인용 78915회 인용 관련 학술자료

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org Full View

Machine-learning research

TG Dietterich - AI magazine, 1997 - ojs.aaai.org

Abstract Machine-learning research has been making great progress in many directions.
This article summarizes four of these directions and discusses some current open problems …

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

[PDF][PDF] Learning agents for uncertain environments

S Russell - Proceedings of the eleventh annual conference on …, 1998 - dl.acm.org

This talk proposes a very simple “baseline architecture” for a learning agent that can handle
stochastic, partially observable environments. The architecture uses reinforcement learning …

저장 인용 756회 인용 관련 학술자료 전체 16개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Learning policies for partially observable environments: Scaling up

ML Littman, AR Cassandra, LP Kaelbling - Machine Learning Proceedings …, 1995 - Elsevier

Partially observable Markov decision processes (POMDP's) model decision problems in
which an agent tries to maximize its reward in the face of limited and/or noisy sensor …

저장 인용 1055회 인용 관련 학술자료 전체 16개의 버전

Deep reinforcement learning with its application for lung cancer detection in medical Internet of Things

Z Liu, C Yao, H Yu, T Wu - Future Generation Computer Systems, 2019 - Elsevier

Recently, deep reinforcement learning has achieved great success by integrating deep
learning models into reinforcement learning algorithms in various applications such as …

저장 인용 167회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Value-function approximations for partially observable Markov decision processes

M Hauskrecht - Journal of artificial intelligence research, 2000 - jair.org

Partially observable Markov decision processes (POMDPs) provide an elegant
mathematical framework for modeling complex decision and planning problems in …

[Free GPT-4]
[DeepSeek]

[PDF] tudelft.nl

Partially observable Markov decision processes

MTJ Spaan - Reinforcement learning: State-of-the-art, 2012 - Springer

For reinforcement learning in environments in which an agent has access to a reliable state
signal, methods based on the Markov decision process (MDP) have had many successes. In …

저장 인용 461회 인용 관련 학술자료 전체 16개의 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Approximating optimal policies for partially observable stochastic domains

Multi-agent reinforcement learning: A selective overview of theories and algorithms

Decision-theoretic planning: Structural assumptions and computational leverage

[책][B] Planning algorithms

Reinforcement learning: An introduction

Machine-learning research

[PDF][PDF] Learning agents for uncertain environments

Learning policies for partially observable environments: Scaling up

Deep reinforcement learning with its application for lung cancer detection in medical Internet of Things

Value-function approximations for partially observable Markov decision processes

Partially observable Markov decision processes