Google Učenjak

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Shrani Navedi Navedeno v 2227 virih Sorodni članki Vse različice: 3 V obliki HTML

Path planning and obstacle avoidance for AUV: A review

C Cheng, Q Sha, B He, G Li - Ocean Engineering, 2021 - Elsevier

Autonomous underwater vehicle plays a more and more important role in the exploration of
marine resources. Path planning and obstacle avoidance is the core technology to realize …

Shrani Navedi Navedeno v 248 virih Sorodni članki Vse različice: 3

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Offline reinforcement learning as one big sequence modeling problem

M Janner, Q Li, S Levine - Advances in neural information …, 2021 - proceedings.neurips.cc

Reinforcement learning (RL) is typically viewed as the problem of estimating single-step
policies (for model-free RL) or single-step models (for model-based RL), leveraging the …

Shrani Navedi Navedeno v 816 virih Sorodni članki Vse različice: 9 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mastering atari, go, chess and shogi by planning with a learned model

J Schrittwieser, I Antonoglou, T Hubert, K Simonyan… - Nature, 2020 - nature.com

Constructing agents with planning capabilities has long been one of the main challenges in
the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge …

Shrani Navedi Navedeno v 2755 virih Sorodni članki Vse različice: 16

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

[NAVEDBA][C] An introduction to variational autoencoders

DP Kingma, M Welling - Foundations and Trends® in …, 2019 - nowpublishers.com

An Introduction to Variational Autoencoders Page 1 An Introduction to Variational Autoencoders
Page 2 Other titles in Foundations and Trends R in Machine Learning Computational Optimal …

Shrani Navedi Navedeno v 3392 virih Sorodni članki Vse različice: 11 Iskanje knjižnic V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Shrani Navedi Navedeno v 134 virih Sorodni članki Vse različice: 6

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

When to trust your model: Model-based policy optimization

M Janner, J Fu, M Zhang… - Advances in neural …, 2019 - proceedings.neurips.cc

Designing effective model-based reinforcement learning algorithms is difficult because the
ease of data generation must be weighed against the bias of model-generated data. In this …

Shrani Navedi Navedeno v 1131 virih Sorodni članki Vse različice: 10 V obliki HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] dm_control: Software and tasks for continuous control

S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez… - Software Impacts, 2020 - Elsevier

The dm_control software package is a collection of Python libraries and task suites for
reinforcement learning agents in an articulated-body simulation. Infrastructure includes a …

Shrani Navedi Navedeno v 422 virih Sorodni članki Vse različice: 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Soft actor-critic algorithms and applications

T Haarnoja, A Zhou, K Hartikainen, G Tucker… - arxiv preprint arxiv …, 2018 - arxiv.org

Model-free deep reinforcement learning (RL) algorithms have been successfully applied to a
range of challenging sequential decision making and control tasks. However, these methods …

Shrani Navedi Navedeno v 3293 virih Sorodni članki Vse različice: 4 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com

Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Shrani Navedi Navedeno v 947 virih Sorodni članki Vse različice: 15 Iskanje knjižnic V obliki HTML

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Learning continuous control policies by stochastic value gradients

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

Path planning and obstacle avoidance for AUV: A review

Offline reinforcement learning as one big sequence modeling problem

Mastering atari, go, chess and shogi by planning with a learned model

[NAVEDBA][C] An introduction to variational autoencoders

A survey on model-based reinforcement learning

When to trust your model: Model-based policy optimization

[HTML][HTML] dm_control: Software and tasks for continuous control

Soft actor-critic algorithms and applications

Model-based reinforcement learning: A survey