Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc
Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

Efficient diffusion transformer with step-wise dynamic attention mediators

Y Pu, Z **a, J Guo, D Han, Q Li, D Li, Y Yuan… - … on Computer Vision, 2024 - Springer
This paper identifies significant redundancy in the query-key interactions within self-attention
mechanisms of diffusion transformer models, particularly during the early stages of …

Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning

S Wang, Q Yang, J Gao, M Lin… - Advances in …, 2024 - proceedings.neurips.cc
Offline-to-online reinforcement learning (RL) is a training paradigm that combines pre-
training on a pre-collected dataset with fine-tuning in an online environment. However, the …

Understanding, predicting and better resolving Q-value divergence in offline-RL

Y Yue, R Lu, B Kang, S Song… - Advances in Neural …, 2024 - proceedings.neurips.cc
The divergence of the Q-value estimation has been a prominent issue offline reinforcement
learning (offline RL), where the agent has no access to real dynamics. Traditional beliefs …

ACL-QL: Adaptive Conservative Level in -Learning for Offline Reinforcement Learning

K Wu, Y Zhao, Z Xu, Z Che, C Yin… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Offline reinforcement learning (RL), which operates solely on static datasets without further
interactions with the environment, provides an appealing alternative to learning a safe and …

Domain: Mildly conservative model-based offline reinforcement learning

XY Liu, XH Zhou, MJ Gui, XL **e, SQ Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Model-based reinforcement learning (RL), which learns environment model from offline
dataset and generates more out-of-distribution model data, has become an effective …

Decoupled Prioritized Resampling for Offline RL

Y Yue, B Kang, X Ma, Q Yang, G Huang… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Offline reinforcement learning (RL) is challenged by the distributional shift problem. To
tackle this issue, existing works mainly focus on designing sophisticated policy constraints …