Mars: An instance-aware, modular and realistic simulator for autonomous driving

Z Wu, T Liu, L Luo, Z Zhong, J Chen, H **ao… - … Conference on Artificial …, 2023 - Springer
Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely
recognized that realistic sensor simulation will play a critical role in solving remaining corner …

Explore until confident: Efficient exploration for embodied question answering

AZ Ren, J Clark, A Dixit, M Itkina, A Majumdar… - arxiv preprint arxiv …, 2024 - arxiv.org
We consider the problem of Embodied Question Answering (EQA), which refers to settings
where an embodied agent such as a robot needs to actively explore an environment to …

Pad: A dataset and benchmark for pose-agnostic anomaly detection

Q Zhou, W Li, L Jiang, G Wang… - Advances in …, 2023 - proceedings.neurips.cc
Object anomaly detection is an important problem in the field of machine vision and has
seen remarkable progress recently. However, two significant challenges hinder its research …

P-mapnet: Far-seeing map generator enhanced by both sdmap and hdmap priors

Z Jiang, Z Zhu, P Li, H Gao, T Yuan… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Autonomous vehicles are gradually entering city roads today, with the help of high-definition
maps (HDMaps). However, the reliance on HDMaps prevents autonomous vehicles from …

Training-free model merging for multi-target domain adaptation

W Li, H Gao, M Gao, B Tian, R Zhi, H Zhao - European Conference on …, 2024 - Springer
In this paper, we study multi-target domain adaptation of scene understanding models.
While previous methods achieved commendable results through inter-domain consistency …

Fairdiff: Fair segmentation with point-image diffusion

W Li, H Xu, G Zhang, H Gao, M Gao, M Wang… - … Conference on Medical …, 2024 - Springer
Fairness is an important topic for medical image analysis, driven by the challenge of
unbalanced training data among diverse target groups and the societal demand for …

Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs

J Chen, X Li, X Ye, C Li, Z Fan, H Zhao - arxiv preprint arxiv:2404.04363, 2024 - arxiv.org
With the success of 2D diffusion models, 2D AIGC content has already transformed our lives.
Recently, this success has been extended to 3D AIGC, with state-of-the-art methods …

Hint-ad: Holistically aligned interpretability in end-to-end autonomous driving

K Ding, B Chen, Y Su, H Gao, B **, C Sima… - arxiv preprint arxiv …, 2024 - arxiv.org
End-to-end architectures in autonomous driving (AD) face a significant challenge in
interpretability, impeding human-AI trust. Human-friendly natural language has been …

Ensemble uncertainty guided road scene anomaly detection: A simple meta-learning approach

Y Liu, X Wei, P Lasang, S Pranata… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
A simple method is presented for road scene anomaly detection, which is particularly
important in autonomous driving. Without the prior knowledge about the appearance and …

UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving

D Bogdoll, N Ollick, T Joseph, S Pavlitska… - arxiv preprint arxiv …, 2024 - arxiv.org
Dealing with atypical traffic scenarios remains a challenging task in autonomous driving.
However, most anomaly detection approaches cannot be trained on raw sensor data but …