- Academic Search

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier

Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential
decision-making in complex problems. RL is inspired by trial-and-error based human/animal …

保存引用被引用数: 213 関連記事全 2 バージョン

Explainable reinforcement learning in production control of job shop manufacturing system

A Kuhnle, MC May, L Schäfer… - International Journal of …, 2022 - Taylor & Francis

Manufacturing in the age of Industry 4.0 can be characterised by a high product variety and
complex material flows. The increasing individualisation of products requires adaptive …

保存引用被引用数: 55 関連記事全 4 バージョン

[Free GPT-4]

[PDF] neurips.cc

Uncertainty-based offline reinforcement learning with diversified q-ensemble

G An, S Moon, JH Kim… - Advances in neural …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (offline RL), which aims to find an optimal policy from a
previously collected static dataset, bears algorithmic difficulties due to function …

保存引用被引用数: 309 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Randomized ensembled double q-learning: Learning fast without a model

X Chen, C Wang, Z Zhou, K Ross - ar**

H Sun, L Han, R Yang, X Ma… - Advances in neural …, 2022 - proceedings.neurips.cc

In this work, we study the simple yet universally applicable case of reward sha** in value-
based Deep Reinforcement Learning (DRL). We show that reward shifting in the form of a …

保存引用被引用数: 25 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

保存引用被引用数: 121 関連記事全 7 バージョン

Relmogen: Integrating motion generation in reinforcement learning for mobile manipulation

F **a, C Li, R Martín-Martín, O Litany… - … on Robotics and …, 2021 - ieeexplore.ieee.org

Many Reinforcement Learning (RL) approaches use joint control signals (positions,
velocities, torques) as action space for continuous control tasks. We propose to lift the action …

保存引用被引用数: 74 関連記事全 3 バージョン

[Free GPT-4]

[PDF] arxiv.org

Thompson sampling for improved exploration in gflownets

J Rector-Brooks, K Madan, M Jain, M Korablyov… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative flow networks (GFlowNets) are amortized variational inference algorithms that
treat sampling from a distribution over compositional objects as a sequential decision …

保存引用被引用数: 20 関連記事全 3 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Better exploration with optimistic actor critic

Reinforcement learning algorithms: A brief survey

Explainable reinforcement learning in production control of job shop manufacturing system

Uncertainty-based offline reinforcement learning with diversified q-ensemble

Randomized ensembled double q-learning: Learning fast without a model

Exploration in deep reinforcement learning: From single-agent to multiagent domain

Relmogen: Integrating motion generation in reinforcement learning for mobile manipulation

Thompson sampling for improved exploration in gflownets