SIFT meets CNN: A decade survey of instance retrieval

L Zheng, Y Yang, Q Tian - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
In the early days, content-based image retrieval (CBIR) was studied with global features.
Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively …

Deep learning for instance retrieval: A survey

W Chen, Y Liu, W Wang, EM Bakker… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
In recent years a vast amount of visual content has been generated and shared from many
fields, such as social media platforms, medical imaging, and robotics. This abundance of …

YOLOv7-RAR for urban vehicle detection

Y Zhang, Y Sun, Z Wang, Y Jiang - Sensors, 2023 - mdpi.com
Aiming at the problems of high missed detection rates of the YOLOv7 algorithm for vehicle
detection on urban roads, weak perception of small targets in perspective, and insufficient …

A fast and accurate one-stage approach to visual grounding

Z Yang, B Gong, L Wang, W Huang… - Proceedings of the …, 2019 - openaccess.thecvf.com
We propose a simple, fast, and accurate one-stage approach to visual grounding, inspired
by the following insight. The performances of existing propose-and-rank two-stage methods …

Sat: 2d semantics assisted training for 3d visual grounding

Z Yang, S Zhang, L Wang, J Luo - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract 3D visual grounding aims at grounding a natural language description about a 3D
scene, usually represented in the form of 3D point clouds, to the targeted object region. Point …

[PDF][PDF] Deep image retrieval: A survey

W Chen, Y Liu, W Wang… - arxiv preprint …, 2021 - scholarlypublications …
In recent years a vast amount of visual content has been generated and shared from various
fields, such as social media platforms, medical images, and robotics. This abundance of …

Look around and refer: 2d synthetic semantics knowledge distillation for 3d visual grounding

E Bakr, Y Alsaedy, M Elhoseiny - Advances in neural …, 2022 - proceedings.neurips.cc
Abstract 3D visual grounding task has been explored with visual and language streams to
comprehend referential language for identifying targeted objects in 3D scenes. However …

Study of object detection based on Faster R-CNN

B Liu, W Zhao, Q Sun - 2017 Chinese automation congress …, 2017 - ieeexplore.ieee.org
Faster R-CNN (R corresponds to “Region”) which combined the RPN network and the Fast
R-CNN network is one of the best ways to object detection of R-CNN series based on deep …

Design and implementation of real-time object detection system based on single-shoot detector and OpenCV

F Wahab, I Ullah, A Shah, RA Khan, A Choi… - Frontiers in …, 2022 - frontiersin.org
Computer vision (CV) and human–computer interaction (HCI) are essential in many
technological fields. Researchers in CV are particularly interested in real-time object …

One-shot instance segmentation

C Michaelis, I Ustyuzhaninov, M Bethge… - arxiv preprint arxiv …, 2018 - arxiv.org
We tackle the problem of one-shot instance segmentation: Given an example image of a
novel, previously unknown object category, find and segment all objects of this category …