Reinforcement learning algorithms: A brief survey

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier
Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential
decision-making in complex problems. RL is inspired by trial-and-error based human/animal …

Explainable reinforcement learning in production control of job shop manufacturing system

A Kuhnle, MC May, L Schäfer… - International Journal of …, 2022 - Taylor & Francis
Manufacturing in the age of Industry 4.0 can be characterised by a high product variety and
complex material flows. The increasing individualisation of products requires adaptive …

Uncertainty-based offline reinforcement learning with diversified q-ensemble

G An, S Moon, JH Kim… - Advances in neural …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (offline RL), which aims to find an optimal policy from a
previously collected static dataset, bears algorithmic difficulties due to function …

Randomized ensembled double q-learning: Learning fast without a model

X Chen, C Wang, Z Zhou, K Ross - ar**
H Sun, L Han, R Yang, X Ma… - Advances in neural …, 2022 - proceedings.neurips.cc
In this work, we study the simple yet universally applicable case of reward sha** in value-
based Deep Reinforcement Learning (DRL). We show that reward shifting in the form of a …

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

Relmogen: Integrating motion generation in reinforcement learning for mobile manipulation

F **a, C Li, R Martín-Martín, O Litany… - … on Robotics and …, 2021 - ieeexplore.ieee.org
Many Reinforcement Learning (RL) approaches use joint control signals (positions,
velocities, torques) as action space for continuous control tasks. We propose to lift the action …

Thompson sampling for improved exploration in gflownets

J Rector-Brooks, K Madan, M Jain, M Korablyov… - arxiv preprint arxiv …, 2023 - arxiv.org
Generative flow networks (GFlowNets) are amortized variational inference algorithms that
treat sampling from a distribution over compositional objects as a sequential decision …