Exploration in deep reinforcement learning: A survey

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Decoupling exploration and exploitation for meta-reinforcement learning without sacrifices

EZ Liu, A Raghunathan, P Liang… - … conference on machine …, 2021 - proceedings.mlr.press
The goal of meta-reinforcement learning (meta-RL) is to build agents that can quickly learn
new tasks by leveraging prior experience on related tasks. Learning a new task often …

Learning options via compression

Y Jiang, E Liu, B Eysenbach… - Advances in Neural …, 2022 - proceedings.neurips.cc
Identifying statistical regularities in solutions to some tasks in multi-task reinforcement
learning can accelerate the learning of new tasks. Skill learning offers one way of identifying …

Giving feedback on interactive student programs with meta-exploration

E Liu, M Stephan, A Nie, C Piech… - Advances in Neural …, 2022 - proceedings.neurips.cc
Develo** interactive software, such as websites or games, is a particularly engaging way
to learn computer science. However, teaching and giving feedback on such software is time …

D2SR: transferring dense reward function to sparse by network resetting

Y Luo, Y Wang, K Dong, Y Liu, Z Sun… - … Conference on Real …, 2023 - ieeexplore.ieee.org
In Reinforcement Learning (RL), most algorithms use a fixed reward function, and few
studies discuss transferring the reward function during learning. Actually, different types of …

[BOK][B] Meta-Reinforcement Learning: Algorithms and Applications

EZ Liu - 2023 - search.proquest.com
Reinforcement learning from scratch often requires a tremendous number of samples to
learn complex tasks, but many real-world applications demand learning from only a few …

Sample-Efficient Algorithms for Hard-Exploration Problems in Reinforcement Learning

Y Guo - 2022 - deepblue.lib.umich.edu
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize
cumulative rewards through trial-and-error interactions with dynamic environments. In recent …

Abstracción en aprendizaje por refuerzo: enfoques y aplicación demostrativa

A Francisco Toral - 2021 - oa.upm.es
La mente es lo que diferencia a los seres humanos del resto de animales. Esta capacidad
de razonar, dialogar, pensar y sentir sólo ha sido encontrada en nosotros mismos. Estas …