Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A review of modern recommender systems using generative models (gen-recsys)
Traditional recommender systems typically use user-item rating histories as their main data
source. However, deep generative models now have the capability to model and sample …
source. However, deep generative models now have the capability to model and sample …
Tabpedia: Towards comprehensive visual table understanding with concept synergy
Tables contain factual and quantitative data accompanied by various structures and
contents that pose challenges for machine comprehension. Previous methods generally …
contents that pose challenges for machine comprehension. Previous methods generally …
Recommendation with generative models
Generative models are a class of AI models capable of creating new instances of data by
learning and sampling from their statistical distributions. In recent years, these models have …
learning and sampling from their statistical distributions. In recent years, these models have …
Leveraging temporal contextualization for video action recognition
We propose a novel framework for video understanding, called Temporally Contextualized
CLIP (TC-CLIP), which leverages essential temporal information through global interactions …
CLIP (TC-CLIP), which leverages essential temporal information through global interactions …
Rethinking clip-based video learners in cross-domain open-vocabulary action recognition
Building upon the impressive success of CLIP (Contrastive Language-Image Pretraining),
recent pioneer works have proposed to adapt the powerful CLIP to video data, leading to …
recent pioneer works have proposed to adapt the powerful CLIP to video data, leading to …
Foundation models for video understanding: A survey
Video Foundation Models (ViFMs) aim to develop general-purpose representations for
various video understanding tasks by leveraging large-scale datasets and powerful models …
various video understanding tasks by leveraging large-scale datasets and powerful models …
Awt: Transferring vision-language models via augmentation, weighting, and transportation
Pre-trained vision-language models (VLMs) have shown impressive results in various visual
classification tasks. However, we often fail to fully unleash their potential when adapting …
classification tasks. However, we often fail to fully unleash their potential when adapting …
Llavidal: Benchmarking large language vision models for daily activities of living
Large Language Vision Models (LLVMs) have demonstrated effectiveness in processing
internet videos, yet they struggle with the visually perplexing dynamics present in Activities …
internet videos, yet they struggle with the visually perplexing dynamics present in Activities …
Mote: Reconciling generalization with specialization for visual-language to video knowledge transfer
Transferring visual-language knowledge from large-scale foundation models for video
recognition has proved to be effective. To bridge the domain gap, additional parametric …
recognition has proved to be effective. To bridge the domain gap, additional parametric …
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
In rapidly evolving field of vision-language models (VLMs), contrastive language-image pre-
training (CLIP) has made significant strides, becoming foundation for various downstream …
training (CLIP) has made significant strides, becoming foundation for various downstream …