Google Академія

[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

J Terven, DM Córdova-Esparza… - Machine learning and …, 2023 - mdpi.com

YOLO has become a central real-time object detection system for robotics, driverless cars,
and video monitoring applications. We present a comprehensive analysis of YOLO's …

Зберегти Послатися Цитовано в 1998 джерелах Пов’язані статті Кількість версій: 7 Кеш

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Instruction tuning for large language models: A survey

S Zhang, L Dong, X Li, S Zhang, X Sun, S Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

This paper surveys research works in the quickly advancing field of instruction tuning (IT),
which can also be referred to as supervised fine-tuning (SFT)\footnote {In this paper, unless …

Зберегти Послатися Цитовано в 742 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] ulsan.ac.kr

Yolov9: Learning what you want to learn using programmable gradient information

CY Wang, IH Yeh, HY Mark Liao - European conference on computer …, 2024 - Springer

Today's deep learning methods focus on how to design the objective functions to make the
prediction as close as possible to the target. Meanwhile, an appropriate neural network …

Зберегти Послатися Цитовано в 1588 джерелах Пов’язані статті Кількість версій: 12

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection

M Hussain - Machines, 2023 - mdpi.com

Since its inception in 2015, the YOLO (You Only Look Once) variant of object detectors has
rapidly grown, with the latest release of YOLO-v8 in January 2023. YOLO variants are …

Зберегти Послатися Цитовано в 582 джерелах Пов’язані статті Кількість версій: 5 Кеш

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J Xu, S Liu, A Vahdat, W Byeon… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies
pre-trained text-image diffusion and discriminative models to perform open-vocabulary …

Зберегти Послатися Цитовано в 430 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Sam 2: Segment anything in images and videos

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

Зберегти Послатися Цитовано в 450 джерелах Пов’язані статті Кількість версій: 2 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Visual autoregressive modeling: Scalable image generation via next-scale prediction

K Tian, Y Jiang, Z Yuan, B Peng… - Advances in neural …, 2025 - proceedings.neurips.cc

Abstract We present Visual AutoRegressive modeling (VAR), a new generation paradigm
that redefines the autoregressive learning on images as coarse-to-fine" next-scale …

Зберегти Послатися Цитовано в 153 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Yolo-world: Real-time open-vocabulary object detection

T Cheng, L Song, Y Ge, W Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract The You Only Look Once (YOLO) series of detectors have established themselves
as efficient and practical tools. However their reliance on predefined and trained object …

Зберегти Послатися Цитовано в 230 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A review of convolutional neural networks in computer vision

X Zhao, L Wang, Y Zhang, X Han, M Deveci… - Artificial Intelligence …, 2024 - Springer

In computer vision, a series of exemplary advances have been made in several areas
involving image classification, semantic segmentation, object detection, and image super …

Зберегти Послатися Цитовано в 198 джерелах Пов’язані статті Кількість версій: 4

Gold-YOLO: Efficient object detector via gather-and-distribute mechanism

C Wang, W He, Y Nie, J Guo, C Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc

In the past years, YOLO-series models have emerged as the leading approaches in the area
of real-time object detection. Many studies pushed up the baseline to a higher level by …

Зберегти Послатися Цитовано в 271 джерелах Пов’язані статті Кількість версій: 6 Кеш

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Feature pyramid networks for object detection

[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

Instruction tuning for large language models: A survey

Yolov9: Learning what you want to learn using programmable gradient information

YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection

Open-vocabulary panoptic segmentation with text-to-image diffusion models

Sam 2: Segment anything in images and videos

Visual autoregressive modeling: Scalable image generation via next-scale prediction

Yolo-world: Real-time open-vocabulary object detection

A review of convolutional neural networks in computer vision

Gold-YOLO: Efficient object detector via gather-and-distribute mechanism