Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on video diffusion models
The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
Simda: Simple diffusion adapter for efficient video generation
The recent wave of AI-generated content has witnessed the great development and success
of Text-to-Image (T2I) technologies. By contrast Text-to-Video (T2V) still falls short of …
of Text-to-Image (T2I) technologies. By contrast Text-to-Video (T2V) still falls short of …
Masked video distillation: Rethinking masked feature modeling for self-supervised video representation learning
Benefiting from masked visual modeling, self-supervised video representation learning has
achieved remarkable progress. However, existing methods focus on learning …
achieved remarkable progress. However, existing methods focus on learning …
Prototypical residual networks for anomaly detection and localization
Anomaly detection and localization are widely used in industrial manufacturing for its
efficiency and effectiveness. Anomalies are rare and hard to collect and supervised models …
efficiency and effectiveness. Anomalies are rare and hard to collect and supervised models …
Implicit temporal modeling with learnable alignment for video recognition
Contrastive language-image pretraining (CLIP) has demonstrated remarkable success in
various image tasks. However, how to extend CLIP with effective temporal modeling is still …
various image tasks. However, how to extend CLIP with effective temporal modeling is still …
Open-vclip: Transforming clip to an open-vocabulary video model via interpolated weight optimization
Abstract Contrastive Language-Image Pretraining (CLIP) has demonstrated impressive zero-
shot learning abilities for image understanding, yet limited effort has been made to …
shot learning abilities for image understanding, yet limited effort has been made to …
Motioneditor: Editing video motion via content-aware diffusion
Existing diffusion-based video editing models have made gorgeous advances for editing
attributes of a source video over time but struggle to manipulate the motion information while …
attributes of a source video over time but struggle to manipulate the motion information while …
XVO: Generalized visual odometry via cross-modal self-training
We propose XVO, a semi-supervised learning method for training generalized monocular
Visual Odometry (VO) models with robust off-the-self operation across diverse datasets and …
Visual Odometry (VO) models with robust off-the-self operation across diverse datasets and …
vid-tldr: Training free token merging for light-weight video transformer
Video Transformers have become the prevalent solution for various video downstream tasks
with superior expressive power and flexibility. However these video transformers suffer from …
with superior expressive power and flexibility. However these video transformers suffer from …
Clip-tsa: Clip-assisted temporal self-attention for weakly-supervised video anomaly detection
Video anomaly detection (VAD)–commonly formulated as a multiple-instance learning
problem in a weakly-supervised manner due to its labor-intensive nature–is a challenging …
problem in a weakly-supervised manner due to its labor-intensive nature–is a challenging …