Rethinking robustness assessment: Adversarial attacks on learning-based quadrupedal locomotion controllers

F Shi, C Zhang, T Miki, J Lee, M Hutter… - arxiv preprint arxiv …, 2024 - arxiv.org
Legged locomotion has recently achieved remarkable success with the progress of machine
learning techniques, especially deep reinforcement learning (RL). Controllers employing …

SAM-E: leveraging visual foundation model with sequence imitation for embodied manipulation

J Zhang, C Bai, H He, W **a, Z Wang, B Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Acquiring a multi-task imitation policy in 3D manipulation poses challenges in terms of
scene understanding and action prediction. Current methods employ both 3D representation …

Contrastive representation for data filtering in cross-domain offline reinforcement learning

X Wen, C Bai, K Xu, X Yu, Y Zhang, X Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Cross-domain offline reinforcement learning leverages source domain data with diverse
transition dynamics to alleviate the data requirement for the target domain. However, simply …

Pessimistic value iteration for multi-task data sharing in offline reinforcement learning

C Bai, L Wang, J Hao, Z Yang, B Zhao, Z Wang, X Li - Artificial Intelligence, 2024 - Elsevier
Abstract Offline Reinforcement Learning (RL) has shown promising results in learning a task-
specific policy from a fixed dataset. However, successful offline RL often relies heavily on the …

Constrained ensemble exploration for unsupervised skill discovery

C Bai, R Yang, Q Zhang, K Xu, Y Chen, T **ao… - arxiv preprint arxiv …, 2024 - arxiv.org
Unsupervised Reinforcement Learning (RL) provides a promising paradigm for learning
useful behaviors via reward-free per-training. Existing methods for unsupervised RL mainly …

COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models

K Liu, Z Tang, D Wang, Z Wang, B Zhao, X Li - arxiv preprint arxiv …, 2024 - arxiv.org
Leveraging the powerful reasoning capabilities of large language models (LLMs), recent
LLM-based robot task planning methods yield promising results. However, they mainly focus …

Robust Locomotion Policy with Adaptive Lipschitz Constraint for Legged Robots

Y Zhang, B Nie, Y Gao - IEEE Robotics and Automation Letters, 2024 - ieeexplore.ieee.org
Deep Reinforcement Learning (DRL) has achieved significant advancements in legged
robot locomotion tasks. However, neural network-based control policies suffer from action …

Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning

W **e, C Bai, J Shi, J Yang, Y Ge, W Zhang… - arxiv preprint arxiv …, 2025 - arxiv.org
Humans possess delicate dynamic balance mechanisms that enable them to maintain
stability across diverse terrains and under extreme conditions. However, despite significant …

Moor: Model-based offline policy optimization with a risk dynamics model

X Su, P Li, S Chen - Complex & Intelligent Systems, 2025 - Springer
Offline reinforcement learning (RL) has been widely used in safety-critical domains by
avoiding dangerous and costly online interaction. A significant challenge is addressing …

A Provable Quantile Regression Adapter via Transfer Learning

R Yang, A Zhang, C Bai, X Su, Y Chen - openreview.net
Adapter-tuning strategy is an efficient method in machine learning that introduces
lightweight and sparse trainable parameters into a pretrained model without altering the …