idisc: Internal discretization for monocular depth estimation

L Piccinelli, C Sakaridis, F Yu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Monocular depth estimation is fundamental for 3D scene understanding and downstream
applications. However, even under the supervised setup, it is still challenging and ill-posed …

Neural 3d scene reconstruction with the manhattan-world assumption

H Guo, S Peng, H Lin, Q Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view
images. Many previous works have shown impressive reconstruction results on textured …

Humor: 3d human motion model for robust pose estimation

D Rempe, T Birdal, A Hertzmann… - Proceedings of the …, 2021 - openaccess.thecvf.com
We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose
and shape. Though substantial progress has been made in estimating 3D human motion …

P3depth: Monocular depth estimation with a piecewise planarity prior

V Patil, C Sakaridis, A Liniger… - Proceedings of the …, 2022 - openaccess.thecvf.com
Monocular depth estimation is vital for scene understanding and downstream tasks. We
focus on the supervised setup, in which ground-truth depth is available only at training time …

Nddepth: Normal-distance assisted monocular depth estimation

S Shao, Z Pei, W Chen, X Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Monocular depth estimation has drawn widespread attention from the vision community due
to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep …

Structured3d: A large photo-realistic dataset for structured 3d modeling

J Zheng, J Zhang, J Li, R Tang, S Gao… - Computer Vision–ECCV …, 2020 - Springer
Recently, there has been growing interest in develo** learning-based methods to detect
and utilize salient semi-global or global structures, such as junctions, lines, planes, cuboids …

Affordancellm: Grounding affordance from vision language models

S Qian, W Chen, M Bai, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Affordance grounding refers to the task of finding the area of an object with which one can
interact. It is a fundamental but challenging task as a successful solution requires the …

Guiding monocular depth estimation using depth-attention volume

L Huynh, P Nguyen-Ha, J Matas, E Rahtu… - Computer Vision–ECCV …, 2020 - Springer
Recovering the scene depth from a single image is an ill-posed problem that requires
additional priors, often referred to as monocular depth cues, to disambiguate different 3D …

Novel view synthesis of dynamic scenes with globally coherent depths from a monocular camera

JS Yoon, K Kim, O Gallo, HS Park… - Proceedings of the …, 2020 - openaccess.thecvf.com
This paper presents a new method to synthesize an image from arbitrary views and times
given a collection of images of a dynamic scene. A key challenge for the novel view …

Map-free visual relocalization: Metric pose relative to a single image

E Arnold, J Wynn, S Vicente… - … on Computer Vision, 2022 - Springer
Can we relocalize in a scene represented by a single reference image? Standard visual
relocalization requires hundreds of images and scale calibration to build a scene-specific …