Holistic network virtualization and pervasive network intelligence for 6G

X Shen, J Gao, W Wu, M Li, C Zhou… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
In this tutorial paper, we look into the evolution and prospect of network architecture and
propose a novel conceptual architecture for the 6th generation (6G) networks. The proposed …

A review of visual SLAM methods for autonomous driving vehicles

J Cheng, L Zhang, Q Chen, X Hu, J Cai - Engineering Applications of …, 2022 - Elsevier
Autonomous driving vehicles require both a precise localization and map** solution in
different driving environment. In this context, Simultaneous Localization and Map** …

Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision

C Yang, Y Chen, H Tian, C Tao, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel bird's-eye-view (BEV) detector with perspective supervision, which
converges faster and better suits modern image backbones. Existing state-of-the-art BEV …

Bevfusion: Multi-task multi-sensor fusion with unified bird's-eye view representation

Z Liu, H Tang, A Amini, X Yang, H Mao… - … on robotics and …, 2023 - ieeexplore.ieee.org
Multi-sensor fusion is essential for an accurate and reliable autonomous driving system.
Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with …

Point transformer v2: Grouped vector attention and partition-based pooling

X Wu, Y Lao, L Jiang, X Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
As a pioneering work exploring transformer architecture for 3D point cloud understanding,
Point Transformer achieves impressive results on multiple highly competitive benchmarks. In …

Bevformer: learning bird's-eye-view representation from lidar-camera via spatiotemporal transformers

Z Li, W Wang, H Li, E **e, C Sima, T Lu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Multi-modality fusion strategy is currently the de-facto most competitive solution for 3D
perception tasks. In this work, we present a new framework termed BEVFormer, which learns …

Transfusion: Robust lidar-camera fusion for 3d object detection with transformers

X Bai, Z Hu, X Zhu, Q Huang, Y Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
LiDAR and camera are two important sensors for 3D object detection in autonomous driving.
Despite the increasing popularity of sensor fusion in this field, the robustness against inferior …

Virtual sparse convolution for multimodal 3d object detection

H Wu, C Wen, S Shi, X Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Recently, virtual/pseudo-point-based 3D object detection that seamlessly fuses
RGB images and LiDAR data by depth completion has gained great attention. However …

Bevfusion: A simple and robust lidar-camera fusion framework

T Liang, H **e, K Yu, Z **a, Z Lin… - Advances in …, 2022 - proceedings.neurips.cc
Fusing the camera and LiDAR information has become a de-facto standard for 3D object
detection tasks. Current methods rely on point clouds from the LiDAR sensor as queries to …

Point Transformer V3: Simpler Faster Stronger

X Wu, L Jiang, PS Wang, Z Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …