A review of multimodal image matching: Methods and applications
Multimodal image matching, which refers to identifying and then corresponding the same or
similar structure/content from two or more images that are of significant modalities or …
similar structure/content from two or more images that are of significant modalities or …
Deep learning for instance retrieval: A survey
In recent years a vast amount of visual content has been generated and shared from many
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
Image matching from handcrafted to deep features: A survey
As a fundamental and critical task in various visual applications, image matching can identify
then correspond the same or similar structure/content from two or more images. Over the …
then correspond the same or similar structure/content from two or more images. Over the …
Tracking everything everywhere all at once
We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …
from a video sequence. Prior optical flow or particle video tracking algorithms typically …
Patch-netvlad: Multi-scale fusion of locally-global descriptors for place recognition
Abstract Visual Place Recognition is a challenging task for robotics and autonomous
systems, which must deal with the twin problems of appearance and viewpoint change in an …
systems, which must deal with the twin problems of appearance and viewpoint change in an …
Aspanformer: Detector-free image matching with adaptive span transformer
Generating robust and reliable correspondences across images is a fundamental task for a
diversity of applications. To capture context at both global and local granularity, we propose …
diversity of applications. To capture context at both global and local granularity, we propose …
A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
Grounding image matching in 3d with mast3r
Image Matching is a core component of all best-performing algorithms and pipelines in 3D
vision. Yet despite matching being fundamentally a 3D problem, intrinsically linked to …
vision. Yet despite matching being fundamentally a 3D problem, intrinsically linked to …
Learning target candidate association to keep track of what not to track
The presence of objects that are confusingly similar to the tracked target, poses a
fundamental challenge in appearance-based visual tracking. Such distractor objects are …
fundamental challenge in appearance-based visual tracking. Such distractor objects are …
Cotr: Correspondence transformer for matching across images
We propose a novel framework for finding correspondences in images based on a deep
neural network that, given two images and a query point in one of them, finds its …
neural network that, given two images and a query point in one of them, finds its …