Review and analysis of synthetic dataset generation methods and techniques for application in computer vision
Synthetic datasets, for which we propose the term synthsets, are not a novelty but have
become a necessity. Although they have been used in computer vision since 1989, hel** …
become a necessity. Although they have been used in computer vision since 1989, hel** …
Depth anything: Unleashing the power of large-scale unlabeled data
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …
Recent advances in vision-based indoor navigation: A systematic literature review
Indoor navigation has remained an active research area for the last decade. Unlike outdoor
environments, indoor environments have additional challenges, such as weak signals, low …
environments, indoor environments have additional challenges, such as weak signals, low …
Zoedepth: Zero-shot transfer by combining relative and metric depth
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …
focuses on generalization performance disregarding metric scale, ie relative depth …
Vision transformers for dense prediction
We introduce dense prediction transformers, an architecture that leverages vision
transformers in place of convolutional networks as a backbone for dense prediction tasks …
transformers in place of convolutional networks as a backbone for dense prediction tasks …
Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks
Neural network based computer vision systems are typically built on a backbone, a
pretrained or randomly initialized feature extractor. Several years ago, the default option was …
pretrained or randomly initialized feature extractor. Several years ago, the default option was …
AutoML: A survey of the state-of-the-art
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …
such as image recognition, object detection, and language modeling. However, building a …
Hypersim: A photorealistic synthetic dataset for holistic indoor scene understanding
For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-
pixel ground truth labels from real images. We address this challenge by introducing …
pixel ground truth labels from real images. We address this challenge by introducing …
Midas v3. 1--a model zoo for robust monocular relative depth estimation
We release MiDaS v3. 1 for monocular depth estimation, offering a variety of new models
based on different encoder backbones. This release is motivated by the success of …
based on different encoder backbones. This release is motivated by the success of …
Bilateral grid learning for stereo matching networks
B Xu, Y Xu, X Yang, W Jia… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Real-time performance of stereo matching networks is important for many applications, such
as automatic driving, robot navigation and augmented reality (AR). Although significant …
as automatic driving, robot navigation and augmented reality (AR). Although significant …