Attention mechanisms in computer vision: A survey
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …
this observation, attention mechanisms were introduced into computer vision with the aim of …
[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey
W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier
Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …
objects in images. Using deep convolutional neural networks (CNNs) as the primary …
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
In this paper, we develop an open-set object detector, called Grounding DINO, by marrying
Transformer-based detector DINO with grounded pre-training, which can detect arbitrary …
Transformer-based detector DINO with grounded pre-training, which can detect arbitrary …
Detrs beat yolos on real-time object detection
The YOLO series has become the most popular framework for real-time object detection due
to its reasonable trade-off between speed and accuracy. However we observe that the …
to its reasonable trade-off between speed and accuracy. However we observe that the …
Diffusiondet: Diffusion model for object detection
We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
Oneformer: One transformer to rule universal image segmentation
Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …
segmentation include scene parsing, panoptic segmentation, and, more recently, new …
A simple framework for open-vocabulary segmentation and detection
In this work, we present OpenSeeD, a simple Open-vocabulary Segmentation and Detection
framework that learns from different segmentation and detection datasets. To bridge the gap …
framework that learns from different segmentation and detection datasets. To bridge the gap …
Detrs with collaborative hybrid assignments training
In this paper, we provide the observation that too few queries assigned as positive samples
in DETR with one-to-one set matching leads to sparse supervision on the encoder's output …
in DETR with one-to-one set matching leads to sparse supervision on the encoder's output …
Dino: Detr with improved denoising anchor boxes for end-to-end object detection
We present DINO (\textbf {D} ETR with\textbf {I} mproved de\textbf {N} oising anch\textbf {O} r
boxes), a state-of-the-art end-to-end object detector.% in this paper. DINO improves over …
boxes), a state-of-the-art end-to-end object detector.% in this paper. DINO improves over …
Mask dino: Towards a unified transformer-based framework for object detection and segmentation
In this paper we present Mask DINO, a unified object detection and segmentation
framework. Mask DINO extends DINO (DETR with Improved Denoising Anchor Boxes) by …
framework. Mask DINO extends DINO (DETR with Improved Denoising Anchor Boxes) by …