- Academic Search

CC Aggarwal - 2018 - Springer

“Any AI smart enough to pass a Turing test is smart enough to know to fail it.”–*** Ian
McDonald Neural networks were developed to simulate the human nervous system for …

Save Cite Cited by 3926 Related articles All 15 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] mdpi.com

Recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: Part 1—fundamentals …

X **ang, S Foo - Machine Learning and Knowledge Extraction, 2021 - mdpi.com

The first part of a two-part series of papers provides a survey on recent advances in Deep
Reinforcement Learning (DRL) applications for solving partially observable Markov decision …

Save Cite Cited by 59 Related articles All 6 versions Free GPT-4 Cached

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] The hanabi challenge: A new frontier for ai research

N Bard, JN Foerster, S Chandar, N Burch, M Lanctot… - Artificial Intelligence, 2020 - Elsevier

From the early days of computing, games have been important testbeds for studying how
well machines can do sophisticated decision making. In recent years, machine learning has …

Save Cite Cited by 456 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] nowpublishers.com

Bayesian reinforcement learning: A survey

M Ghavamzadeh, S Mannor, J Pineau… - … and Trends® in …, 2015 - nowpublishers.com

Bayesian methods for machine learning have been widely investigated, yielding principled
methods for incorporating prior information into inference algorithms. In this survey, we …

Save Cite Cited by 593 Related articles All 11 versions Free GPT-4 Library Search View as HTML

[Free GPT-4]

[PDF] neurips.cc

Incremental natural actor-critic algorithms

S Bhatnagar, M Ghavamzadeh… - Advances in neural …, 2007 - proceedings.neurips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-
gradient ideas, and provide their convergence proofs. Actor-critic rein-forcement learning …

Save Cite Cited by 1115 Related articles All 33 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Intelligent traffic light control

M Wiering, J Van Veenen, J Vreeken… - Institute of Information …, 2004 - academia.edu

Vehicular travel is increasing throughout the world, particularly in large urban areas.
Therefore the need arises for simulating and optimizing traffic control algorithms to better …

Save Cite Cited by 301 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] sciencedirect.com

Programming backgammon using self-teaching neural nets

G Tesauro - Artificial Intelligence, 2002 - Elsevier

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by
playing against itself and learning from the results. Starting from random initial play, TD …

Save Cite Cited by 317 Related articles All 15 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Learning to search with mctsnets

A Guez, T Weber, I Antonoglou… - International …, 2018 - proceedings.mlr.press

Planning problems are among the most important and well-studied problems in artificial
intelligence. They are most typically solved by tree search algorithms that simulate ahead …

Save Cite Cited by 100 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] tu-chemnitz.de

Td-gammon: A self-teaching backgammon program

G Tesauro - Applications of neural networks, 1995 - Springer

Furthermore, when a set of hand-crafted features is added to the network's input
representation, the result is a truly staggering level of performance: TO-Gammon is now …

Save Cite Cited by 151 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] ieee.org

Modern value based reinforcement learning: A chronological review

MC McKenzie, MD McDonnell - IEEE Access, 2022 - ieeexplore.ieee.org

Investigation of value based Reinforcement Learning algorithms exhibited a resurgence into
mainstream research in 2015 following demonstration of super-human performance when …

Save Cite Cited by 9 Related articles All 2 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Knightcap: a chess program that learns by combining td (lambda) with game-tree search

[BOOK][B] Neural networks and deep learning

Recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: Part 1—fundamentals …

[HTML][HTML] The hanabi challenge: A new frontier for ai research

Bayesian reinforcement learning: A survey

Incremental natural actor-critic algorithms

[PDF][PDF] Intelligent traffic light control

Programming backgammon using self-teaching neural nets

Learning to search with mctsnets

Td-gammon: A self-teaching backgammon program

Modern value based reinforcement learning: A chronological review