Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Recent advances and perspectives in deep learning techniques for 3D point cloud data processing

Z Ding, Y Sun, S Xu, Y Pan, Y Peng, Z Mao - Robotics, 2023 - mdpi.com
In recent years, deep learning techniques for processing 3D point cloud data have seen
significant advancements, given their unique ability to extract relevant features and handle …

R2former: Unified retrieval and reranking transformer for place recognition

S Zhu, L Yang, C Chen, M Shah… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Visual Place Recognition (VPR) estimates the location of query images by matching
them with images in a reference database. Conventional methods generally adopt …

Back to the feature: Learning robust camera localization from pixels to pose

PE Sarlin, A Unagar, M Larsson… - Proceedings of the …, 2021 - openaccess.thecvf.com
Camera pose estimation in known scenes is a 3D geometry task recently tackled by multiple
learning algorithms. Many regress precise geometric quantities, like poses or 3D points …

D2-net: A trainable cnn for joint description and detection of local features

M Dusmanu, I Rocco, T Pajdla… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this work we address the problem of finding reliable pixel-level correspondences under
difficult imaging conditions. We propose an approach where a single convolutional neural …

Pixel-perfect structure-from-motion with featuremetric refinement

P Lindenberger, PE Sarlin… - Proceedings of the …, 2021 - openaccess.thecvf.com
Finding local features that are repeatable across multiple views is a cornerstone of sparse
3D reconstruction. The classical image matching paradigm detects keypoints per-image …

Megadepth: Learning single-view depth prediction from internet photos

Z Li, N Snavely - Proceedings of the IEEE conference on …, 2018 - openaccess.thecvf.com
Single-view depth prediction is a fundamental problem in computer vision. Recently, deep
learning methods have led to significant progress, but such methods are limited by the …

Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval

T Weyand, A Araujo, B Cao… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
While image retrieval and instance recognition techniques are progressing rapidly, there is a
need for challenging datasets to accurately measure their performance--while posing novel …

D2-net: A trainable cnn for joint detection and description of local features

M Dusmanu, I Rocco, T Pajdla, M Pollefeys… - arxiv preprint arxiv …, 2019 - arxiv.org
In this work we address the problem of finding reliable pixel-level correspondences under
difficult imaging conditions. We propose an approach where a single convolutional neural …

Understanding the limitations of cnn-based absolute camera pose regression

T Sattler, Q Zhou, M Pollefeys… - Proceedings of the …, 2019 - openaccess.thecvf.com
Visual localization is the task of accurate camera pose estimation in a known scene. It is a
key problem in computer vision and robotics, with applications including self-driving cars …