[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

J Terven, DM Córdova-Esparza… - Machine learning and …, 2023 - mdpi.com
YOLO has become a central real-time object detection system for robotics, driverless cars,
and video monitoring applications. We present a comprehensive analysis of YOLO's …

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - International Journal of …, 2024 - Springer
Abstract Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks across different data modalities. A PFM (eg, BERT, ChatGPT, GPT-4) is …

Segment anything

A Kirillov, E Mintun, N Ravi, H Mao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We introduce the Segment Anything (SA) project: a new task, model, and dataset for
image segmentation. Using our efficient model in a data collection loop, we built the largest …

Minigpt-v2: large language model as a unified interface for vision-language multi-task learning

J Chen, D Zhu, X Shen, X Li, Z Liu, P Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models have shown their remarkable capabilities as a general interface for
various language-related applications. Motivated by this, we target to build a unified …

UAV-YOLOv8: A small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios

G Wang, Y Chen, P An, H Hong, J Hu, T Huang - Sensors, 2023 - mdpi.com
Unmanned aerial vehicle (UAV) object detection plays a crucial role in civil, commercial, and
military domains. However, the high proportion of small objects in UAV images and the …

Yolo-world: Real-time open-vocabulary object detection

T Cheng, L Song, Y Ge, W Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract The You Only Look Once (YOLO) series of detectors have established themselves
as efficient and practical tools. However their reliance on predefined and trained object …

A review of convolutional neural networks in computer vision

X Zhao, L Wang, Y Zhang, X Han, M Deveci… - Artificial Intelligence …, 2024 - Springer
In computer vision, a series of exemplary advances have been made in several areas
involving image classification, semantic segmentation, object detection, and image super …

Spikingjelly: An open-source machine learning infrastructure platform for spike-based intelligence

W Fang, Y Chen, J Ding, Z Yu, T Masquelier… - Science …, 2023 - science.org
Spiking neural networks (SNNs) aim to realize brain-inspired intelligence on neuromorphic
chips with high energy efficiency by introducing neural dynamics and spike properties. As …

Convnext v2: Co-designing and scaling convnets with masked autoencoders

S Woo, S Debnath, R Hu, X Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Driven by improved architectures and better representation learning frameworks, the field of
visual recognition has enjoyed rapid modernization and performance boost in the early …

[HTML][HTML] AI for life: Trends in artificial intelligence for biotechnology

A Holzinger, K Keiblinger, P Holub, K Zatloukal… - New biotechnology, 2023 - Elsevier
Due to popular successes (eg, ChatGPT) Artificial Intelligence (AI) is on everyone's lips
today. When advances in biotechnology are combined with advances in AI unprecedented …