Reinforcement learning approaches in social robotics

N Akalin, A Loutfi - Sensors, 2021 - mdpi.com
This article surveys reinforcement learning approaches in social robotics. Reinforcement
learning is a framework for decision-making problems in which an agent interacts through …

Goal-conditioned reinforcement learning: Problems and solutions

M Liu, M Zhu, W Zhang - arxiv preprint arxiv:2201.08299, 2022 - arxiv.org
Goal-conditioned reinforcement learning (GCRL), related to a set of complex RL problems,
trains an agent to achieve different goals under particular scenarios. Compared to the …

Exploration via hindsight goal generation

Z Ren, K Dong, Y Zhou, Q Liu… - Advances in Neural …, 2019 - proceedings.neurips.cc
Goal-oriented reinforcement learning has recently been a practical framework for robotic
manipulation tasks, in which an agent is required to reach a certain goal defined by a …

[HTML][HTML] Imaginary filtered hindsight experience replay for UAV tracking dynamic targets in large-scale unknown environments

HU Zijian, GAO **aoguang, WAN Kaifang… - Chinese Journal of …, 2023 - Elsevier
As an advanced combat weapon, Unmanned Aerial Vehicles (UAVs) have been widely
used in military wars. In this paper, we formulated the Autonomous Navigation Control …

Multi-Robot Environmental Coverage With a Two-Stage Coordination Strategy via Deep Reinforcement Learning

L Zhu, J Cheng, H Zhang, W Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Multi-robot environmental coverage can be widely used in many applications like search
and rescue. However, it is challenging to coordinate the robot team for high coverage …

Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards

Y Luo, Y Wang, K Dong, Q Zhang, E Cheng, Z Sun… - Neurocomputing, 2023 - Elsevier
Learning with sparse rewards remains a challenging problem in reinforcement learning
(RL). In particular, for sequential object manipulation tasks, the RL agent generally only …

Batch prioritization in multigoal reinforcement learning

LF Vecchietti, T Kim, K Choi, J Hong, D Har - IEEE Access, 2020 - ieeexplore.ieee.org
In multigoal reinforcement learning, an agent interacts with an environment and learns to
achieve multiple goals. The goal-conditioned policy is trained to effectively generalize its …

Building open-ended embodied agent via language-policy bidirectional adaptation

S Zhai, J Wang, T Zhang, F Huang, Q Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Building embodied agents on integrating Large Language Models (LLMs) and
Reinforcement Learning (RL) have revolutionized human-AI interaction: researchers can …

A controllable agent by subgoals in path planning using goal-conditioned reinforcement learning

GT Lee, K Kim - IEEE Access, 2023 - ieeexplore.ieee.org
The aim of path planning is to search for a path from the starting point to the goal. Numerous
studies, however, have dealt with a single predefined goal. That is, an agent who has …

Optimal bipartite graph matching-based goal selection for policy-based hindsight learning

S Sun, H Zhang, Z Liu, X Chen, X Lan - Neurocomputing, 2024 - Elsevier
The sparse reward problem stands as a significant challenge in the field of reinforcement
learning. Hindsight Experience Replay (HER) addresses this by goal relabeling, allowing …