SIFT meets CNN: A decade survey of instance retrieval

L Zheng, Y Yang, Q Tian - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
In the early days, content-based image retrieval (CBIR) was studied with global features.
Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively …

Recent advance in content-based image retrieval: A literature survey

W Zhou, H Li, Q Tian - arxiv preprint arxiv:1706.06064, 2017 - arxiv.org
The explosive increase and ubiquitous accessibility of visual data on the Web have led to
the prosperity of research activity in image search or retrieval. With the ignorance of visual …

Task residual for tuning vision-language models

T Yu, Z Lu, X **, Z Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned
general visual representations and broad visual concepts. In principle, the well-learned …

Visual translation embedding network for visual relation detection

H Zhang, Z Kyaw, SF Chang… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Visual relations, such as" person ride bike" and" bike next to car", offer a comprehensive
scene understanding of an image, and have already shown their great utility in connecting …

Interpreting clip with sparse linear concept embeddings (splice)

U Bhalla, A Oesterling, S Srinivas… - Advances in …, 2025 - proceedings.neurips.cc
CLIP embeddings have demonstrated remarkable performance across a wide range of
multimodal applications. However, these high-dimensional, dense vector representations …

Image retrieval using scene graphs

J Johnson, R Krishna, M Stark, LJ Li… - Proceedings of the …, 2015 - openaccess.thecvf.com
This paper develops a novel framework for semantic image retrieval based on the notion of
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …

Decaf: A deep convolutional activation feature for generic visual recognition

J Donahue, Y Jia, O Vinyals… - International …, 2014 - proceedings.mlr.press
We evaluate whether features extracted from the activation of a deep convolutional network
trained in a fully supervised fashion on a large, fixed set of object recognition tasks can be re …

Zero-shot visual recognition using semantics-preserving adversarial embedding networks

L Chen, H Zhang, J **ao, W Liu… - Proceedings of the …, 2018 - openaccess.thecvf.com
We propose a novel framework called Semantics-Preserving Adversarial Embedding
Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their …

Neural codes for image retrieval

A Babenko, A Slesarev, A Chigorin… - Computer Vision–ECCV …, 2014 - Springer
It has been shown that the activations invoked by an image within the top layers of a large
convolutional neural network provide a high-level descriptor of the visual content of the …

Attribute-based classification for zero-shot visual object categorization

CH Lampert, H Nickisch… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
We study the problem of object recognition for categories for which we have no training
examples, a task also called zero--data or zero-shot learning. This situation has hardly been …