Hierarchical fine-grained image forgery detection and localization
Differences in forgery attributes of images generated in CNN-synthesized and image-editing
domains are large, and such differences make a unified image forgery detection and …
domains are large, and such differences make a unified image forgery detection and …
Explicit visual prompting for low-level structure segmentations
We consider the generic problem of detecting low-level structures in images, which includes
segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow …
segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow …
Bevt: Bert pretraining of video transformers
This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …
worth-studying extension given the recent success from BERT pretraining of image …
Adavit: Adaptive vision transformers for efficient image recognition
Built on top of self-attention mechanisms, vision transformers have demonstrated
remarkable performance on a variety of vision tasks recently. While achieving excellent …
remarkable performance on a variety of vision tasks recently. While achieving excellent …
Wave-vit: Unifying wavelet and transformers for visual representation learning
Abstract Multi-scale Vision Transformer (ViT) has emerged as a powerful backbone for
computer vision tasks, while the self-attention computation in Transformer scales …
computer vision tasks, while the self-attention computation in Transformer scales …
Omnitokenizer: A joint image-video tokenizer for visual generation
Tokenizer, serving as a translator to map the intricate visual data into a compact latent
space, lies at the core of visual generative models. Based on the finding that existing …
space, lies at the core of visual generative models. Based on the finding that existing …
Trufor: Leveraging all-round clues for trustworthy image forgery detection and localization
F Guillaro, D Cozzolino, A Sud… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper we present TruFor, a forensic framework that can be applied to a large variety
of image manipulation methods, from classic cheapfakes to more recent manipulations …
of image manipulation methods, from classic cheapfakes to more recent manipulations …
M2tr: Multi-modal multi-scale transformers for deepfake detection
The widespread dissemination of Deepfakes demands effective approaches that can detect
perceptually convincing forged images. In this paper, we aim to capture the subtle …
perceptually convincing forged images. In this paper, we aim to capture the subtle …
Look before you match: Instance understanding matters in video object segmentation
Exploring dense matching between the current frame and past frames for long-range context
modeling, memory-based methods have demonstrated impressive results in video object …
modeling, memory-based methods have demonstrated impressive results in video object …
Edge-aware regional message passing controller for image forgery localization
Digital image authenticity has promoted research on image forgery localization. Although
deep learning-based methods achieve remarkable progress, most of them usually suffer …
deep learning-based methods achieve remarkable progress, most of them usually suffer …