Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Towards zero-shot scale-aware monocular depth estimation

V Guizilini, I Vasiljevic, D Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Monocular depth estimation is scale-ambiguous, and thus requires scale supervision to
produce metric predictions. Even so, the resulting models will be geometry-specific, with …

Deep digging into the generalization of self-supervised monocular depth estimation

J Bae, S Moon, S Im - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Self-supervised monocular depth estimation has been widely studied recently. Most of the
work has focused on improving performance on benchmark datasets, such as KITTI, but has …

R3d3: Dense 3d reconstruction of dynamic scenes from multiple cameras

A Schmied, T Fischer, M Danelljan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Dense 3D reconstruction and ego-motion estimation are key challenges in autonomous
driving and robotics. Compared to the complex, multi-modal systems deployed today, multi …

Learning to fuse monocular and multi-view cues for multi-frame depth estimation in dynamic scenes

R Li, D Gong, W Yin, H Chen, Y Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multi-frame depth estimation generally achieves high accuracy relying on the multi-view
geometric consistency. When applied in dynamic scenes, eg, autonomous driving, this …

Sts: Surround-view temporal stereo for multi-view 3d detection

Z Wang, C Min, Z Ge, Y Li, Z Li, H Yang… - arxiv preprint arxiv …, 2022 - arxiv.org
Learning accurate depth is essential to multi-view 3D object detection. Recent approaches
mainly learn depth from monocular images, which confront inherent difficulties due to the ill …

Learning temporally consistent video depth from video diffusion priors

J Shao, Y Yang, H Zhou, Y Zhang, Y Shen… - arxiv preprint arxiv …, 2024 - arxiv.org
This work addresses the challenge of video depth estimation, which expects not only per-
frame accuracy but, more importantly, cross-frame consistency. Instead of directly …

Promotion: Prototypes as motion learners

Y Lu, D Liu, Q Wang, C Han, Y Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this work we introduce ProMotion a unified prototypical transformer-based framework
engineered to model fundamental motion tasks. ProMotion offers a range of compelling …

Dualrefine: Self-supervised depth and pose estimation through iterative epipolar sampling and refinement toward equilibrium

A Bangunharcana, A Magd… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Self-supervised multi-frame depth estimation achieves high accuracy by computing
matching costs of pixel correspondences between adjacent frames, injecting geometric …

Cvrecon: Rethinking 3d geometric feature learning for neural reconstruction

Z Feng, L Yang, P Guo, B Li - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recent advances in neural reconstruction using posed image sequences have made
remarkable progress. However, due to the lack of depth information, existing volumetric …