Google Učenjak

Data-driven robotic manipulation of cloth-like deformable objects: The present, challenges and future prospects

HA Kadi, K Terzić - Sensors, 2023 - mdpi.com

Manipulating cloth-like deformable objects (CDOs) is a long-standing problem in the
robotics community. CDOs are flexible (non-rigid) objects that do not show a detectable level …

Shrani Navedi Navedeno v 11 virih Sorodni članki Vse različice: 16 Posnetek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to locomote: Understanding how environment design matters for deep reinforcement learning

D Reda, T Tao, M van de Panne - Proceedings of the 13th ACM …, 2020 - dl.acm.org

Learning to locomote is one of the most common tasks in physics-based animation and
deep reinforcement learning (RL). A learned policy is the product of the problem to be …

Shrani Navedi Navedeno v 74 virih Sorodni članki Vse različice: 4

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning to configure separators in branch-and-cut

S Li, W Ouyang, M Paulus… - Advances in Neural …, 2023 - proceedings.neurips.cc

Cutting planes are crucial in solving mixed integer linear programs (MILP) as they facilitate
bound improvements on the optimal solution. Modern MILP solvers rely on a variety of …

Shrani Navedi Navedeno v 12 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Temporl: Learning when to act

A Biedenkapp, R Rajan, F Hutter… - … on Machine Learning, 2021 - proceedings.mlr.press

Reinforcement learning is a powerful approach to learn behaviour through interactions with
an environment. However, behaviours are usually learned in a purely reactive fashion …

Shrani Navedi Navedeno v 41 virih Sorodni članki Vse različice: 12 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Taac: Temporally abstract actor-critic for continuous control

H Yu, W Xu, H Zhang - Advances in neural information …, 2021 - proceedings.neurips.cc

We present temporally abstract actor-critic (TAAC), a simple but effective off-policy RL
algorithm that incorporates closed-loop temporal abstraction into the actor-critic framework …

Shrani Navedi Navedeno v 32 virih Sorodni članki Vse različice: 8 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Time discretization-invariant safe action repetition for policy gradient methods

S Park, J Kim, G Kim - Advances in Neural Information …, 2021 - proceedings.neurips.cc

In reinforcement learning, continuous time is often discretized by a time scale $\delta $, to
which the resulting performance is known to be highly sensitive. In this work, we seek to find …

Shrani Navedi Navedeno v 27 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

No-regret reinforcement learning in smooth mdps

D Maran, AM Metelli, M Papini, M Restell - arxiv preprint arxiv:2402.03792, 2024 - arxiv.org

Obtaining no-regret guarantees for reinforcement learning (RL) in the case of problems with
continuous state and/or action spaces is still one of the major open challenges in the field …

Shrani Navedi Navedeno v 7 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] oapen.org

[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org

Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

Shrani Navedi Navedeno v 8 virih Sorodni članki Vse različice: 9 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] polimi.it

Addressing non-stationarity in fx trading with online model selection of offline rl experts

A Riva, L Bisi, P Liotet, L Sabbioni, E Vittori… - Proceedings of the …, 2022 - dl.acm.org

Reinforcement learning has proven to be successful in obtaining profitable trading policies;
however, the effectiveness of such strategies is strongly conditioned to market stationarity …

Shrani Navedi Navedeno v 13 virih Sorodni članki Vse različice: 5

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Addressing action oscillations through learning policy inertia

C Chen, H Tang, J Hao, W Liu, Z Meng - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a
wide range of challenging decision making and control tasks. However, these methods …

Shrani Navedi Navedeno v 21 virih Sorodni članki Vse različice: 5 V obliki HTML

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Control frequency adaptation via action persistence in batch reinforcement learning

Data-driven robotic manipulation of cloth-like deformable objects: The present, challenges and future prospects

Learning to locomote: Understanding how environment design matters for deep reinforcement learning

Learning to configure separators in branch-and-cut

Temporl: Learning when to act

Taac: Temporally abstract actor-critic for continuous control

Time discretization-invariant safe action repetition for policy gradient methods

No-regret reinforcement learning in smooth mdps

[PDF][PDF] Configurable environments in reinforcement learning: An overview

Addressing non-stationarity in fx trading with online model selection of offline rl experts

Addressing action oscillations through learning policy inertia