- Academic Search

E Bengio, J Pineau, D Precup - International Conference on …, 2020 - proceedings.mlr.press

We study the link between generalization and interference in temporal-difference (TD)
learning. Interference is defined as the inner product of two different gradients, representing …

Speichern Zitieren Zitiert von: 72 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] State-of-the-art reinforcement learning algorithms

D Mehta - International Journal of Engineering Research and …, 2020 - academia.edu

This research paper brings together many different aspects of the current research on
several fields associated to Reinforcement Learning which has been growing rapidly …

Speichern Zitieren Zitiert von: 38 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

PBCS: Efficient Exploration and Exploitation Using a Synergy Between Reinforcement Learning and Motion Planning

G Matheron, N Perrin, O Sigaud - International Conference on Artificial …, 2020 - Springer

The exploration-exploitation trade-off is at the heart of reinforcement learning (RL). However,
most continuous control benchmarks used in recent RL research only require local …

Speichern Zitieren Zitiert von: 19 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] neurips.cc

Adaptive temporal-difference learning for policy evaluation with per-state uncertainty estimates

C Riquelme, H Penedones, D Vincent… - Advances in …, 2019 - proceedings.neurips.cc

We consider the core reinforcement-learning problem of on-policy value function
approximation from a batch of trajectory data, and focus on various issues of Temporal …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] mcgill.ca

[BUCH][B] Generalization, optimization, diverse generation: insights and advances in the use of bootstrap** in deep neural networks

E Bengio - 2022 - search.proquest.com

This thesis investigates the use of bootstrap** in Temporal Difference (TD) learning, a
central mechanism in reinforcement learning (RL), when applied to deep neural networks. I …

Speichern Zitieren Ähnliche Artikel Alle 3 Versionen Bibliothekssuche

[Free GPT-4]

[PDF] cyberleninka.ru Full View

Использование нейронных сетей для решения игровых задач на примере задачи поиска пути в лабиринте

ДО Романников, АА Воевода - … , вычислительная техника и …, 2018 - cyberleninka.ru

Рассматривается решение игровых задач на примере задачи поиска пути в лабиринте
при помощи нейронной сети. Такая задача может быть решена одним из …

Speichern Zitieren Zitiert von: 4 Ähnliche Artikel Alle 3 Versionen Im Cache

[Free GPT-4]

[PDF] hal.science

Unsupervised Pretraining of State Representations in a Rewardless Environment

A Merckling - 2021 - theses.hal.science

This thesis seeks to extend the capabilities of state representation learning (SRL) to help
scale deep reinforcement learning (DRL) algorithms to continuous control tasks with high …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] hal.science

Integrating motion planning into reinforcement learning to solve hard exploration problems

G Matheron - 2020 - theses.hal.science

Motion planning is able to solve robotics problems much quicker than any reinforcement
learning algorithm by efficiently searching for a viable trajectory. Indeed, while the main …

Speichern Zitieren Ähnliche Artikel Alle 6 Versionen Bibliothekssuche HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Temporal difference learning with neural networks-study of the leakage propagation problem

Interference and generalization in temporal difference learning

[PDF][PDF] State-of-the-art reinforcement learning algorithms

PBCS: Efficient Exploration and Exploitation Using a Synergy Between Reinforcement Learning and Motion Planning

Adaptive temporal-difference learning for policy evaluation with per-state uncertainty estimates

[BUCH][B] Generalization, optimization, diverse generation: insights and advances in the use of bootstrap** in deep neural networks

Использование нейронных сетей для решения игровых задач на примере задачи поиска пути в лабиринте

Unsupervised Pretraining of State Representations in a Rewardless Environment

Integrating motion planning into reinforcement learning to solve hard exploration problems