Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Recent advances on loss functions in deep learning for computer vision
The loss function, also known as cost function, is used for training a neural network or other
machine learning models. Over the past decade, researchers have designed many loss …
machine learning models. Over the past decade, researchers have designed many loss …
Vision transformers for dense prediction: A survey
S Zuo, Y **ao, X Chang, X Wang - Knowledge-based systems, 2022 - Elsevier
Transformers have demonstrated impressive expressiveness and transfer capability in
computer vision fields. Dense prediction is a fundamental problem in computer vision that is …
computer vision fields. Dense prediction is a fundamental problem in computer vision that is …
Vision transformers need registers
Transformers have recently emerged as a powerful tool for learning visual representations.
In this paper, we identify and characterize artifacts in feature maps of both supervised and …
In this paper, we identify and characterize artifacts in feature maps of both supervised and …
Simple open-vocabulary object detection
Combining simple architectures with large-scale pre-training has led to massive
improvements in image classification. For object detection, pre-training and scaling …
improvements in image classification. For object detection, pre-training and scaling …
Centralized feature pyramid for object detection
Y Quan, D Zhang, L Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The visual feature pyramid has shown its superiority in both effectiveness and efficiency in a
variety of applications. However, current methods overly focus on inter-layer feature …
variety of applications. However, current methods overly focus on inter-layer feature …
Conformer: Local features coupling global representations for visual recognition
Abstract Within Convolutional Neural Network (CNN), the convolution operations are good
at extracting local features but experience difficulty to capture global representations. Within …
at extracting local features but experience difficulty to capture global representations. Within …
Group detr: Fast detr training with group-wise one-to-many assignment
Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth
object to one prediction, for end-to-end detection without NMS post-processing. It is known …
object to one prediction, for end-to-end detection without NMS post-processing. It is known …
Grit: Faster and better image captioning transformer using dual visual features
Current state-of-the-art methods for image captioning employ region-based features, as they
provide object-level information that is essential to describe the content of images; they are …
provide object-level information that is essential to describe the content of images; they are …
Vits for sits: Vision transformers for satellite image time series
M Tarasiou, E Chavez… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper we introduce the Temporo-Spatial Vision Transformer (TSViT), a fully-
attentional model for general Satellite Image Time Series (SITS) processing based on the …
attentional model for general Satellite Image Time Series (SITS) processing based on the …
Cascade-DETR: delving into high-quality universal object detection
Object localization in general environments is a fundamental part of vision systems. While
dominating on the COCO benchmark, recent Transformer-based detection methods are not …
dominating on the COCO benchmark, recent Transformer-based detection methods are not …