Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Grid-centric traffic scenario perception for autonomous driving: A comprehensive review

Y Shi, K Jiang, J Li, Z Qian, J Wen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
The grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, the grid-centric perception is less prevalent than object-centric perception as …

Fb-bev: Bev representation from forward-backward view transformations

Z Li, Z Yu, W Wang, A Anandkumar… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract View Transformation Module (VTM), where transformations happen between multi-
view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera …

Exploring recurrent long-term temporal fusion for multi-view 3d perception

C Han, J Yang, J Sun, Z Ge, R Dong… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Long-term temporal fusion is a crucial but often overlooked technique in camera-based
Bird's-Eye-View (BEV) 3D perception. Existing methods are mostly in a parallel manner …

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Y Li, B Huang, Z Chen, Y Cui, F Liang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Recently, perception task based on Bird's-Eye View (BEV) representation has drawn more
and more attention, and BEV representation is promising as the foundation for next …

Uniworld: Autonomous driving pre-training via world models

C Min, D Zhao, L **ao, Y Nie, B Dai - arxiv preprint arxiv:2308.07234, 2023 - arxiv.org
In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he
introduced the concept of the occupancy grid as World Models for robots. We imbue the …

V2VFormer: Multi-Modal Vehicle-to-Vehicle Cooperative Perception via Global-Local Transformer

H Yin, D Tian, C Lin, X Duan, J Zhou… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Multi-vehicle cooperative perception has recently emerged for facilitating long-range and
large-scale perception ability of connected automated vehicles (CAVs). Nonetheless …

PointBeV: A Sparse Approach for BeV Predictions

L Chambon, E Zablocki, M Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Bird's-eye View (BeV) representations have emerged as the de-facto shared space
in driving applications offering a unified space for sensor data fusion and supporting various …

Inversematrixvt3d: An efficient projection matrix-based approach for 3d occupancy prediction

Z Ming, JS Berrio, M Shan, S Worrall - arxiv preprint arxiv:2401.12422, 2024 - arxiv.org
This paper introduces InverseMatrixVT3D, an efficient method for transforming multi-view
image features into 3D feature volumes for 3D semantic occupancy prediction. Existing …

RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies

X Chu, J Deng, G You, Y Duan, Y Li… - Proceedings of the 32nd …, 2024 - dl.acm.org
The recent advances in query-based multi-camera 3D object detection are featured by
initializing object queries in the 3D space, and then sampling features from perspective-view …