Google Académico

Review and analysis of synthetic dataset generation methods and techniques for application in computer vision

G Paulin, M Ivasic‐Kos - Artificial intelligence review, 2023 - Springer

Synthetic datasets, for which we propose the term synthsets, are not a novelty but have
become a necessity. Although they have been used in computer vision since 1989, hel** …

Guardar Citar Citado por 61 Artículos relacionados Las 6 versiones

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Guardar Citar Citado por 586 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Recent advances in vision-based indoor navigation: A systematic literature review

D Khan, Z Cheng, H Uchiyama, S Ali, M Asshad… - Computers & …, 2022 - Elsevier

Indoor navigation has remained an active research area for the last decade. Unlike outdoor
environments, indoor environments have additional challenges, such as weak signals, low …

Guardar Citar Citado por 48 Artículos relacionados Las 2 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Zoedepth: Zero-shot transfer by combining relative and metric depth

SF Bhat, R Birkl, D Wofk, P Wonka, M Müller - arxiv preprint arxiv …, 2023 - arxiv.org

This paper tackles the problem of depth estimation from a single image. Existing work either
focuses on generalization performance disregarding metric scale, ie relative depth …

Guardar Citar Citado por 408 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Vision transformers for dense prediction

R Ranftl, A Bochkovskiy… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We introduce dense prediction transformers, an architecture that leverages vision
transformers in place of convolutional networks as a backbone for dense prediction tasks …

Guardar Citar Citado por 2044 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks

M Goldblum, H Souri, R Ni, M Shu… - Advances in …, 2024 - proceedings.neurips.cc

Neural network based computer vision systems are typically built on a backbone, a
pretrained or randomly initialized feature extractor. Several years ago, the default option was …

Guardar Citar Citado por 63 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

AutoML: A survey of the state-of-the-art

X He, K Zhao, X Chu - Knowledge-based systems, 2021 - Elsevier

Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …

Guardar Citar Citado por 1951 Artículos relacionados Las 8 versiones

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Hypersim: A photorealistic synthetic dataset for holistic indoor scene understanding

M Roberts, J Ramapuram, A Ranjan… - Proceedings of the …, 2021 - openaccess.thecvf.com

For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-
pixel ground truth labels from real images. We address this challenge by introducing …

Guardar Citar Citado por 343 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Midas v3. 1--a model zoo for robust monocular relative depth estimation

R Birkl, D Wofk, M Müller - arxiv preprint arxiv:2307.14460, 2023 - arxiv.org

We release MiDaS v3. 1 for monocular depth estimation, offering a variety of new models
based on different encoder backbones. This release is motivated by the success of …

Guardar Citar Citado por 127 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Bilateral grid learning for stereo matching networks

B Xu, Y Xu, X Yang, W Jia… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Real-time performance of stereo matching networks is important for many applications, such
as automatic driving, robot navigation and augmented reality (AR). Although significant …

Guardar Citar Citado por 108 Artículos relacionados Las 5 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Irs: A large naturalistic indoor robotics stereo dataset to train deep models for disparity...

Review and analysis of synthetic dataset generation methods and techniques for application in computer vision

Depth anything: Unleashing the power of large-scale unlabeled data

Recent advances in vision-based indoor navigation: A systematic literature review

Zoedepth: Zero-shot transfer by combining relative and metric depth

Vision transformers for dense prediction

Battle of the backbones: A large-scale comparison of pretrained models across computer vision tasks

AutoML: A survey of the state-of-the-art

Hypersim: A photorealistic synthetic dataset for holistic indoor scene understanding

Midas v3. 1--a model zoo for robust monocular relative depth estimation

Bilateral grid learning for stereo matching networks