NTIRE 2023 challenge on image super-resolution (x4): Methods and results

Y Zhang, K Zhang, Z Chen, Y Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper reviews the NTIRE 2023 challenge on image super-resolution (x4), focusing on
the proposed solutions and results. The task of image super-resolution (SR) is to generate a …

Methods and datasets on semantic segmentation for Unmanned Aerial Vehicle remote sensing images: A review

J Cheng, C Deng, Y Su, Z An, Q Wang - ISPRS Journal of Photogrammetry …, 2024 - Elsevier
Abstract Unmanned Aerial Vehicle (UAV) has seen a dramatic rise in popularity for remote-
sensing image acquisition and analysis in recent years. It has brought promising results in …

Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

Dust3r: Geometric 3d vision made easy

S Wang, V Leroy, Y Cabon… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-view stereo reconstruction (MVS) in the wild requires to first estimate the camera
intrinsic and extrinsic parameters. These are usually tedious and cumbersome to obtain yet …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

Voxformer: Sparse voxel transformer for camera-based 3d semantic scene completion

Y Li, Z Yu, C Choy, C **ao, JM Alvarez… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans can easily imagine the complete 3D geometry of occluded objects and scenes. This
appealing ability is vital for recognition and understanding. To enable such capability in AI …

UniDepth: Universal monocular metric depth estimation

L Piccinelli, YH Yang, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks
in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods …

Unleashing text-to-image diffusion models for visual perception

W Zhao, Y Rao, Z Liu, B Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Diffusion models (DMs) have become the new trend of generative models and have
demonstrated a powerful ability of conditional synthesis. Among those, text-to-image …