3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Empowering things with intelligence: a survey of the progress, challenges, and opportunities in artificial intelligence of things

J Zhang, D Tao - IEEE Internet of Things Journal, 2020 - ieeexplore.ieee.org
In the Internet-of-Things (IoT) era, billions of sensors and devices collect and process data
from the environment, transmit them to cloud centers, and receive feedback via the Internet …

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

Spatialvlm: Endowing vision-language models with spatial reasoning capabilities

B Chen, Z Xu, S Kirmani, B Ichter… - Proceedings of the …, 2024 - openaccess.thecvf.com
Understanding and reasoning about spatial relationships is crucial for Visual Question
Answering (VQA) and robotics. Vision Language Models (VLMs) have shown impressive …

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

UniDepth: Universal monocular metric depth estimation

L Piccinelli, YH Yang, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks
in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods …

Lite-mono: A lightweight cnn and transformer architecture for self-supervised monocular depth estimation

N Zhang, F Nex, G Vosselman… - Proceedings of the …, 2023 - openaccess.thecvf.com
Self-supervised monocular depth estimation that does not require ground truth for training
has attracted attention in recent years. It is of high interest to design lightweight but effective …

Nope-nerf: Optimising neural radiance field with no pose prior

W Bian, Z Wang, K Li, JW Bian… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Training a Neural Radiance Field (NeRF) without pre-computed camera poses is
challenging. Recent advances in this direction demonstrate the possibility of jointly …

Bevdepth: Acquisition of reliable depth for multi-view 3d object detection

Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
In this research, we propose a new 3D object detector with a trustworthy depth estimation,
dubbed BEVDepth, for camera-based Bird's-Eye-View~(BEV) 3D object detection. Our work …

idisc: Internal discretization for monocular depth estimation

L Piccinelli, C Sakaridis, F Yu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Monocular depth estimation is fundamental for 3D scene understanding and downstream
applications. However, even under the supervised setup, it is still challenging and ill-posed …