Google 학술 검색

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer

Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

저장 인용 1898회 인용 관련 학술자료 전체 8개의 버전

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey

W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier

Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …

저장 인용 53회 인용 관련 학술자료

[Free GPT-4]

[PDF] arxiv.org

Grounding dino: Marrying dino with grounded pre-training for open-set object detection

S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang… - … on Computer Vision, 2024 - Springer

In this paper, we develop an open-set object detector, called Grounding DINO, by marrying
Transformer-based detector DINO with grounded pre-training, which can detect arbitrary …

저장 인용 1588회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]

[PDF] thecvf.com

Detrs beat yolos on real-time object detection

Y Zhao, W Lv, S Xu, J Wei, G Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

The YOLO series has become the most popular framework for real-time object detection due
to its reasonable trade-off between speed and accuracy. However we observe that the …

[Free GPT-4]

[PDF] thecvf.com

Diffusiondet: Diffusion model for object detection

S Chen, P Sun, Y Song, P Luo - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …

저장 인용 485회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Oneformer: One transformer to rule universal image segmentation

J Jain, J Li, MT Chiu, A Hassani… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …

저장 인용 376회 인용 관련 학술자료 전체 8개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

A simple framework for open-vocabulary segmentation and detection

H Zhang, F Li, X Zou, S Liu, C Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this work, we present OpenSeeD, a simple Open-vocabulary Segmentation and Detection
framework that learns from different segmentation and detection datasets. To bridge the gap …

저장 인용 160회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Detrs with collaborative hybrid assignments training

Z Zong, G Song, Y Liu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

In this paper, we provide the observation that too few queries assigned as positive samples
in DETR with one-to-one set matching leads to sparse supervision on the encoder's output …

저장 인용 357회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] openreview.net

Dino: Detr with improved denoising anchor boxes for end-to-end object detection

H Zhang, F Li, S Liu, L Zhang, H Su, J Zhu… - arxiv preprint arxiv …, 2022 - arxiv.org

We present DINO (\textbf {D} ETR with\textbf {I} mproved de\textbf {N} oising anch\textbf {O} r
boxes), a state-of-the-art end-to-end object detector.% in this paper. DINO improves over …

[Free GPT-4]

[PDF] thecvf.com

Mask dino: Towards a unified transformer-based framework for object detection and segmentation

F Li, H Zhang, H Xu, S Liu, L Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper we present Mask DINO, a unified object detection and segmentation
framework. Mask DINO extends DINO (DETR with Improved Denoising Anchor Boxes) by …

저장 인용 416회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Dab-detr: Dynamic anchor boxes are better queries for detr

Attention mechanisms in computer vision: A survey

[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey

Grounding dino: Marrying dino with grounded pre-training for open-set object detection

Detrs beat yolos on real-time object detection

Diffusiondet: Diffusion model for object detection

Oneformer: One transformer to rule universal image segmentation

A simple framework for open-vocabulary segmentation and detection

Detrs with collaborative hybrid assignments training

Dino: Detr with improved denoising anchor boxes for end-to-end object detection

Mask dino: Towards a unified transformer-based framework for object detection and segmentation