Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Vision-language pre-training: Basics, recent advances, and future trends
This monograph surveys vision-language pre-training (VLP) methods for multimodal
intelligence that have been developed in the last few years. We group these approaches …
intelligence that have been developed in the last few years. We group these approaches …
Digital twin in the IoT context: A survey on technical features, scenarios, and architectural models
Digital twin (DT) is an emerging concept that is gaining attention in various industries. It
refers to the ability to clone a physical object (PO) into a software counterpart. The …
refers to the ability to clone a physical object (PO) into a software counterpart. The …
Open-vocabulary object detection via vision and language knowledge distillation
We aim at advancing open-vocabulary object detection, which detects objects described by
arbitrary text inputs. The fundamental challenge is the availability of training data. It is costly …
arbitrary text inputs. The fundamental challenge is the availability of training data. It is costly …
Learning concise and descriptive attributes for visual recognition
Recent advances in foundation models present new opportunities for interpretable visual
recognition--one can first query Large Language Models (LLMs) to obtain a set of attributes …
recognition--one can first query Large Language Models (LLMs) to obtain a set of attributes …
Align and prompt: Video-and-language pre-training with entity prompts
Video-and-language pre-training has shown promising improvements on various
downstream tasks. Most previous methods capture cross-modal interactions with a …
downstream tasks. Most previous methods capture cross-modal interactions with a …
Elevater: A benchmark and toolkit for evaluating language-augmented visual models
Learning visual representations from natural language supervision has recently shown great
promise in a number of pioneering works. In general, these language-augmented visual …
promise in a number of pioneering works. In general, these language-augmented visual …
Contrastive embedding for generalized zero-shot learning
Generalized zero-shot learning (GZSL) aims to recognize objects from both seen and
unseen classes, when only the labeled examples from seen classes are provided. Recent …
unseen classes, when only the labeled examples from seen classes are provided. Recent …
A review of generalized zero-shot learning methods
Generalized zero-shot learning (GZSL) aims to train a model for classifying data samples
under the condition that some output classes are unknown during supervised learning. To …
under the condition that some output classes are unknown during supervised learning. To …
Progressive semantic-visual mutual adaption for generalized zero-shot learning
Abstract Generalized Zero-Shot Learning (GZSL) identifies unseen categories by knowledge
transferred from the seen domain, relying on the intrinsic interactions between visual and …
transferred from the seen domain, relying on the intrinsic interactions between visual and …
Counterfactual zero-shot and open-set visual recognition
We present a novel counterfactual framework for both Zero-Shot Learning (ZSL) and Open-
Set Recognition (OSR), whose common challenge is generalizing to the unseen-classes by …
Set Recognition (OSR), whose common challenge is generalizing to the unseen-classes by …