SIFT meets CNN: A decade survey of instance retrieval
In the early days, content-based image retrieval (CBIR) was studied with global features.
Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively …
Since 2003, image retrieval based on local descriptors (de facto SIFT) has been extensively …
Deep learning for instance retrieval: A survey
In recent years a vast amount of visual content has been generated and shared from many
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
YOLOv7-RAR for urban vehicle detection
Y Zhang, Y Sun, Z Wang, Y Jiang - Sensors, 2023 - mdpi.com
Aiming at the problems of high missed detection rates of the YOLOv7 algorithm for vehicle
detection on urban roads, weak perception of small targets in perspective, and insufficient …
detection on urban roads, weak perception of small targets in perspective, and insufficient …
A fast and accurate one-stage approach to visual grounding
We propose a simple, fast, and accurate one-stage approach to visual grounding, inspired
by the following insight. The performances of existing propose-and-rank two-stage methods …
by the following insight. The performances of existing propose-and-rank two-stage methods …
Sat: 2d semantics assisted training for 3d visual grounding
Abstract 3D visual grounding aims at grounding a natural language description about a 3D
scene, usually represented in the form of 3D point clouds, to the targeted object region. Point …
scene, usually represented in the form of 3D point clouds, to the targeted object region. Point …
[PDF][PDF] Deep image retrieval: A survey
W Chen, Y Liu, W Wang… - arxiv preprint …, 2021 - scholarlypublications …
In recent years a vast amount of visual content has been generated and shared from various
fields, such as social media platforms, medical images, and robotics. This abundance of …
fields, such as social media platforms, medical images, and robotics. This abundance of …
Look around and refer: 2d synthetic semantics knowledge distillation for 3d visual grounding
Abstract 3D visual grounding task has been explored with visual and language streams to
comprehend referential language for identifying targeted objects in 3D scenes. However …
comprehend referential language for identifying targeted objects in 3D scenes. However …
Study of object detection based on Faster R-CNN
B Liu, W Zhao, Q Sun - 2017 Chinese automation congress …, 2017 - ieeexplore.ieee.org
Faster R-CNN (R corresponds to “Region”) which combined the RPN network and the Fast
R-CNN network is one of the best ways to object detection of R-CNN series based on deep …
R-CNN network is one of the best ways to object detection of R-CNN series based on deep …
Design and implementation of real-time object detection system based on single-shoot detector and OpenCV
Computer vision (CV) and human–computer interaction (HCI) are essential in many
technological fields. Researchers in CV are particularly interested in real-time object …
technological fields. Researchers in CV are particularly interested in real-time object …
One-shot instance segmentation
We tackle the problem of one-shot instance segmentation: Given an example image of a
novel, previously unknown object category, find and segment all objects of this category …
novel, previously unknown object category, find and segment all objects of this category …