Review and analysis of synthetic dataset generation methods and techniques for application in computer vision

G Paulin, M Ivasic‐Kos - Artificial intelligence review, 2023 - Springer
Synthetic datasets, for which we propose the term synthsets, are not a novelty but have
become a necessity. Although they have been used in computer vision since 1989, hel** …

Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Recent advances in vision-based indoor navigation: A systematic literature review

D Khan, Z Cheng, H Uchiyama, S Ali, M Asshad… - Computers & …, 2022 - Elsevier
Indoor navigation has remained an active research area for the last decade. Unlike outdoor
environments, indoor environments have additional challenges, such as weak signals, low …

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org
This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

Vision transformers for dense prediction

R Ranftl, A Bochkovskiy… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce dense prediction transformers, an architecture that leverages vision
transformers in place of convolutional networks as a backbone for dense prediction tasks …

Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks

M Goldblum, H Souri, R Ni, M Shu… - Advances in …, 2024 - proceedings.neurips.cc
Neural network based computer vision systems are typically built on a backbone, a
pretrained or randomly initialized feature extractor. Several years ago, the default option was …

AutoML: A survey of the state-of-the-art

X He, K Zhao, X Chu - Knowledge-based systems, 2021 - Elsevier
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …

Hypersim: A photorealistic synthetic dataset for holistic indoor scene understanding

M Roberts, J Ramapuram, A Ranjan… - Proceedings of the …, 2021 - openaccess.thecvf.com
For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-
pixel ground truth labels from real images. We address this challenge by introducing …

Midas v3. 1--a model zoo for robust monocular relative depth estimation

R Birkl, D Wofk, M Müller - arxiv preprint arxiv:2307.14460, 2023 - arxiv.org
We release MiDaS v3. 1 for monocular depth estimation, offering a variety of new models
based on different encoder backbones. This release is motivated by the success of …

Bilateral grid learning for stereo matching networks

B Xu, Y Xu, X Yang, W Jia… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Real-time performance of stereo matching networks is important for many applications, such
as automatic driving, robot navigation and augmented reality (AR). Although significant …