Graphbev: Towards robust bev feature alignment for multi-modal 3d object detection

Z Song, L Yang, S Xu, L Liu, D Xu, C Jia, F Jia… - … on Computer Vision, 2024 - Springer
Integrating LiDAR and camera information into Bird's-Eye-View (BEV) representation has
emerged as a crucial aspect of 3D object detection in autonomous driving. However …

Deep learning for 3D object recognition: A survey

AAM Muzahid, H Han, Y Zhang, D Li, Y Zhang… - Neurocomputing, 2024 - Elsevier
With the growing availability of extensive 3D datasets and the rapid progress in
computational power, deep learning (DL) has emerged as a highly promising approach for …

Dense object detection methods in RAW UAV imagery based on YOLOv8

Z Wu, X Wang, M Jia, M Liu, C Sun, C Wu, J Wang - Scientific reports, 2024 - nature.com
Accurate, fast and lightweight dense target detection methods are highly important for
precision agriculture. To detect dense apricot flowers using drones, we propose an …

Sparsedet: a simple and effective framework for fully sparse lidar-based 3D object detection

L Liu, Z Song, Q **a, F Jia, C Jia, L Yang… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
LiDAR-based sparse 3-D object detection plays a crucial role in autonomous driving
applications due to its computational efficiency advantages. Existing methods either use the …

Survey of Deep Learning-Based Methods for FMCW Radar Odometry and Ego-Localization

M Brune, T Meisen, A Pomp - Applied Sciences, 2024 - mdpi.com
This paper provides an in-depth review of deep learning techniques to address the
challenges of odometry and global ego-localization using frequency modulated continuous …

CoreNet: Conflict Resolution Network for point-pixel misalignment and sub-task suppression of 3D LiDAR-camera object detection

Y Li, Y Yang, Z Lei - Information Fusion, 2025 - Elsevier
Fusing multi-modality inputs from different sensors is an effective way to improve the
performance of 3D object detection. However, current methods overlook two important …

SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection

S Dong, W **e, D Yang, J Tian, Y Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Multimodal object detection leverages diverse modal information to enhance the accuracy
and robustness of detectors. Due to its ability to capture long-range dependencies, the …

[HTML][HTML] Bi-Att3DDet: Attention-Based Bi-Directional Fusion for Multi-Modal 3D Object Detection

X Gao, Y Zhao, Y Wang, J Shang, C Zhang, G Wu - Sensors, 2025 - mdpi.com
Currently, multi-modal 3D object detection methods have become a key area of research in
the field of autonomous driving. Fusion is an essential factor affecting performance in multi …

Ultra-FastNet: an end-to-end learnable network for multi-person posture prediction

T Peng, Y Luo, Z Ou, J Du, G Lin - The Journal of Supercomputing, 2024 - Springer
At present, the top-down approach requires the introduction of pedestrian detection
algorithms in multi-person pose estimation. In this paper, we propose an end-to-end …

Stacked Multi-Head Cross Attention for Image Recognition

S Liu, X Du, T Wu - Proceedings of the International Conference on …, 2024 - dl.acm.org
Long-range dependency plays a critical role in extracting intricate image features
particularly in tasks involving image recognition. In previous study, the significance of long …