Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2023 - proceedings.neurips.cc
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

Towards scalable neural representation for diverse videos

B He, X Yang, H Wang, Z Wu, H Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Implicit neural representations (INR) have gained increasing attention in representing 3D
scenes and images, and have been recently applied to encode videos (eg, NeRV, E-NeRV) …

Sd4match: Learning to prompt stable diffusion model for semantic matching

X Li, J Lu, K Han, VA Prisacariu - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
In this paper we address the challenge of matching semantically similar keypoints across
image pairs. Existing research indicates that the intermediate output of the UNet within the …

Telling left from right: Identifying geometry-aware semantic correspondence

J Zhang, C Herrmann, J Hur, E Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
While pre-trained large-scale vision models have shown significant promise for semantic
correspondence their features often struggle to grasp the geometry and orientation of …

What is Point Supervision Worth in Video Instance Segmentation?

S Huang, DA Huang, Z Yu, S Lan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video instance segmentation (VIS) is a challenging vision task that aims to detect segment
and track objects in videos. Conventional VIS methods rely on densely annotated object …

Asic: Aligning sparse in-the-wild image collections

K Gupta, V Jampani, C Esteves… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a method for joint alignment of sparse in-the-wild image collections of an object
category. Most prior works assume either ground-truth keypoint annotations or a large …

Improving semantic correspondence with viewpoint-guided spherical maps

O Mariotti, O Mac Aodha… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent self-supervised models produce visual features that are not only effective at
encoding image-level but also pixel-level semantics. They have been reported to obtain …

Efficient Semantic Matching with Hypercolumn Correlation

S Kim, J Min, M Cho - Proceedings of the IEEE/CVF Winter …, 2024 - openaccess.thecvf.com
Recent studies show that leveraging the match-wise relationships within the 4D correlation
map yields significant improvements in establishing semantic correspondences-but at the …

UVIS: Unsupervised Video Instance Segmentation

S Huang, S Suri, K Gupta… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video instance segmentation requires classifying segmenting and tracking every object
across video frames. Unlike existing approaches that rely on masks boxes or category labels …