A comprehensive survey of data augmentation in visual reinforcement learning

G Ma, Z Wang, Z Yuan, X Wang, B Yuan… - arxiv preprint arxiv …, 2022 - arxiv.org
Visual reinforcement learning (RL), which makes decisions directly from high-dimensional
visual inputs, has demonstrated significant potential in various domains. However …

Curiosity-driven and victim-aware adversarial policies

C Gong, Z Yang, Y Bai, J Shi, A Sinha, B Xu… - Proceedings of the 38th …, 2022 - dl.acm.org
Recent years have witnessed great potential in applying Deep Reinforcement Learning
(DRL) in various challenging applications, such as autonomous driving, nuclear fusion …

Revisiting plasticity in visual reinforcement learning: Data, modules and training stages

G Ma, L Li, S Zhang, Z Liu, Z Wang, Y Chen… - arxiv preprint arxiv …, 2023 - arxiv.org
Plasticity, the ability of a neural network to evolve with new data, is crucial for high-
performance and sample-efficient visual reinforcement learning (VRL). Although methods …

Learning better with less: effective augmentation for sample-efficient visual reinforcement learning

G Ma, L Zhang, H Wang, L Li, Z Wang… - Advances in …, 2024 - proceedings.neurips.cc
Data augmentation (DA) is a crucial technique for enhancing the sample efficiency of visual
reinforcement learning (RL) algorithms. Notably, employing simple observation …

Cross-Observability Optimistic-Pessimistic Safe Reinforcement Learning for Interactive Motion Planning With Visual Occlusion

X Hou, M Gan, W Wu, Y Ji, S Zhao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This study focuses on the motion planning and risk evaluation of unprotected left turns at
occluded intersections for autonomous vehicles. In this paper, we present an interactive …

Risk-Conscious Mutations in Jump-Start Reinforcement Learning for Autonomous Racing Policy

X Hou, M Gan, W Wu, S Zhao, Y Ji… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This study focuses on trajectory planning and motion control policies in autonomous racing,
which necessitates pushing the capacity boundaries of racing vehicles to achieve maximum …

An Improved Prioritized DDPG Based on Fractional-Order Learning Scheme

QY Fan, M Cai, B Xu - IEEE Transactions on Neural Networks …, 2024 - ieeexplore.ieee.org
Although deep deterministic policy gradient (DDPG) algorithm gets widespread attention as
a result of its powerful functionality and applicability for large-scale continuous control, it …

CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning

C Sun, H Qian, C Miao - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Offline reinforcement learning (RL) aims to learn an effective policy from a pre-collected
dataset. Most existing works are to develop sophisticated learning algorithms, with less …

Curiosity-tuned experience replay for wargaming decision modeling without reward-engineering

L Dong, N Li, G Gong - Simulation Modelling Practice and Theory, 2023 - Elsevier
Reinforcement Learning (RL) has become a promising technique to deal with the tough
decision modeling problem in the wargaming field. However, to deploy current RL …

Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning

Y Huang, P Peng, Y Zhao, Y Zhai… - 2023 IEEE/CVF …, 2023 - ieeexplore.ieee.org
Efficient motion and appearance modeling are critical for vision-based Reinforcement
Learning (RL). However, existing methods struggle to reconcile motion and appearance …