Google Tudós

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Mentés Hivatkozás Idézetek száma: 373 Kapcsolódó cikkek Mind a(z) 5 változat

[Free GPT-4]

[PDF] nsf.gov

Formalizing planning and information search in naturalistic decision-making

LT Hunt, ND Daw, P Kaanders, MA MacIver… - Nature …, 2021 - nature.com

Decisions made by mammals and birds are often temporally extended. They require
planning and sampling of decision-relevant information. Our understanding of such decision …

Mentés Hivatkozás Idézetek száma: 102 Kapcsolódó cikkek Mind a(z) 14 változat

[Free GPT-4]

[PDF] ed.ac.uk

Exploration by random network distillation

Y Burda, H Edwards, A Storkey, O Klimov - arxiv preprint arxiv …, 2018 - arxiv.org

We introduce an exploration bonus for deep reinforcement learning methods that is easy to
implement and adds minimal overhead to the computation performed. The bonus is the error …

Mentés Hivatkozás Idézetek száma: 1592 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]

[PDF] neurips.cc

Behavior from the void: Unsupervised active pre-training

H Liu, P Abbeel - Advances in Neural Information …, 2021 - proceedings.neurips.cc

We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …

Mentés Hivatkozás Idézetek száma: 220 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]

[PDF] mlr.press

Aps: Active pretraining with successor features

H Liu, P Abbeel - International Conference on Machine …, 2021 - proceedings.mlr.press

We introduce a new unsupervised pretraining objective for reinforcement learning. During
the unsupervised reward-free pretraining phase, the agent maximizes mutual information …

Mentés Hivatkozás Idézetek száma: 151 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving

J Wu, Z Huang, Z Hu, C Lv - Engineering, 2023 - Elsevier

Due to its limited intelligence and abilities, machine learning is currently unable to handle
various situations thus cannot completely replace humans in real-world applications …

Mentés Hivatkozás Idézetek száma: 127 Kapcsolódó cikkek Mind a(z) 4 változat

[Free GPT-4]

[PDF] neurips.cc

Semantic exploration from language abstractions and pretrained representations

A Tam, N Rabinowitz, A Lampinen… - Advances in neural …, 2022 - proceedings.neurips.cc

Effective exploration is a challenge in reinforcement learning (RL). Novelty-based
exploration methods can suffer in high-dimensional state spaces, such as continuous …

Mentés Hivatkozás Idézetek száma: 68 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]

[PDF] mlr.press

Efficient reinforcement learning in block mdps: A model-free representation learning approach

X Zhang, Y Song, M Uehara, M Wang… - International …, 2022 - proceedings.mlr.press

We present BRIEE, an algorithm for efficient reinforcement learning in Markov Decision
Processes with block-structured dynamics (ie, Block MDPs), where rich observations are …

Mentés Hivatkozás Idézetek száma: 73 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]

[PDF] neurips.cc

Noveld: A simple yet effective exploration criterion

T Zhang, H Xu, X Wang, Y Wu… - Advances in …, 2021 - proceedings.neurips.cc

Efficient exploration under sparse rewards remains a key challenge in deep reinforcement
learning. Previous exploration methods (eg, RND) have achieved strong results in multiple …

Mentés Hivatkozás Idézetek száma: 84 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]

[PDF] neurips.cc

Exploration via elliptical episodic bonuses

M Henaff, R Raileanu, M Jiang… - Advances in Neural …, 2022 - proceedings.neurips.cc

In recent years, a number of reinforcement learning (RL) methods have been pro-posed to
explore complex environments which differ across episodes. In this work, we show that the …

Mentés Hivatkozás Idézetek száma: 45 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Count-based exploration with the successor representation

Exploration in deep reinforcement learning: A survey

Formalizing planning and information search in naturalistic decision-making

Exploration by random network distillation

Behavior from the void: Unsupervised active pre-training

Aps: Active pretraining with successor features

[HTML][HTML] Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving

Semantic exploration from language abstractions and pretrained representations

Efficient reinforcement learning in block mdps: A model-free representation learning approach

Noveld: A simple yet effective exploration criterion

Exploration via elliptical episodic bonuses