Data-driven robotic manipulation of cloth-like deformable objects: The present, challenges and future prospects

HA Kadi, K Terzić - Sensors, 2023 - mdpi.com
Manipulating cloth-like deformable objects (CDOs) is a long-standing problem in the
robotics community. CDOs are flexible (non-rigid) objects that do not show a detectable level …

Learning to locomote: Understanding how environment design matters for deep reinforcement learning

D Reda, T Tao, M van de Panne - Proceedings of the 13th ACM …, 2020 - dl.acm.org
Learning to locomote is one of the most common tasks in physics-based animation and
deep reinforcement learning (RL). A learned policy is the product of the problem to be …

Learning to configure separators in branch-and-cut

S Li, W Ouyang, M Paulus… - Advances in Neural …, 2023 - proceedings.neurips.cc
Cutting planes are crucial in solving mixed integer linear programs (MILP) as they facilitate
bound improvements on the optimal solution. Modern MILP solvers rely on a variety of …

Temporl: Learning when to act

A Biedenkapp, R Rajan, F Hutter… - … on Machine Learning, 2021 - proceedings.mlr.press
Reinforcement learning is a powerful approach to learn behaviour through interactions with
an environment. However, behaviours are usually learned in a purely reactive fashion …

Taac: Temporally abstract actor-critic for continuous control

H Yu, W Xu, H Zhang - Advances in neural information …, 2021 - proceedings.neurips.cc
We present temporally abstract actor-critic (TAAC), a simple but effective off-policy RL
algorithm that incorporates closed-loop temporal abstraction into the actor-critic framework …

Time discretization-invariant safe action repetition for policy gradient methods

S Park, J Kim, G Kim - Advances in Neural Information …, 2021 - proceedings.neurips.cc
In reinforcement learning, continuous time is often discretized by a time scale $\delta $, to
which the resulting performance is known to be highly sensitive. In this work, we seek to find …

No-regret reinforcement learning in smooth mdps

D Maran, AM Metelli, M Papini, M Restell - arxiv preprint arxiv:2402.03792, 2024 - arxiv.org
Obtaining no-regret guarantees for reinforcement learning (RL) in the case of problems with
continuous state and/or action spaces is still one of the major open challenges in the field …

[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org
Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

Addressing non-stationarity in fx trading with online model selection of offline rl experts

A Riva, L Bisi, P Liotet, L Sabbioni, E Vittori… - Proceedings of the …, 2022 - dl.acm.org
Reinforcement learning has proven to be successful in obtaining profitable trading policies;
however, the effectiveness of such strategies is strongly conditioned to market stationarity …

Addressing action oscillations through learning policy inertia

C Chen, H Tang, J Hao, W Liu, Z Meng - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a
wide range of challenging decision making and control tasks. However, these methods …