Hierarchical fine-grained image forgery detection and localization

X Guo, X Liu, Z Ren, S Grosz… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
Differences in forgery attributes of images generated in CNN-synthesized and image-editing
domains are large, and such differences make a unified image forgery detection and …

Explicit visual prompting for low-level structure segmentations

W Liu, X Shen, CM Pun, X Cun - Proceedings of the IEEE …, 2023‏ - openaccess.thecvf.com
We consider the generic problem of detecting low-level structures in images, which includes
segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow …

Bevt: Bert pretraining of video transformers

R Wang, D Chen, Z Wu, Y Chen… - Proceedings of the …, 2022‏ - openaccess.thecvf.com
This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …

Adavit: Adaptive vision transformers for efficient image recognition

L Meng, H Li, BC Chen, S Lan, Z Wu… - Proceedings of the …, 2022‏ - openaccess.thecvf.com
Built on top of self-attention mechanisms, vision transformers have demonstrated
remarkable performance on a variety of vision tasks recently. While achieving excellent …

Wave-vit: Unifying wavelet and transformers for visual representation learning

T Yao, Y Pan, Y Li, CW Ngo, T Mei - European Conference on Computer …, 2022‏ - Springer
Abstract Multi-scale Vision Transformer (ViT) has emerged as a powerful backbone for
computer vision tasks, while the self-attention computation in Transformer scales …

Omnitokenizer: A joint image-video tokenizer for visual generation

J Wang, Y Jiang, Z Yuan, B Peng… - Advances in Neural …, 2025‏ - proceedings.neurips.cc
Tokenizer, serving as a translator to map the intricate visual data into a compact latent
space, lies at the core of visual generative models. Based on the finding that existing …

Trufor: Leveraging all-round clues for trustworthy image forgery detection and localization

F Guillaro, D Cozzolino, A Sud… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
In this paper we present TruFor, a forensic framework that can be applied to a large variety
of image manipulation methods, from classic cheapfakes to more recent manipulations …

M2tr: Multi-modal multi-scale transformers for deepfake detection

J Wang, Z Wu, W Ouyang, X Han, J Chen… - Proceedings of the …, 2022‏ - dl.acm.org
The widespread dissemination of Deepfakes demands effective approaches that can detect
perceptually convincing forged images. In this paper, we aim to capture the subtle …

Look before you match: Instance understanding matters in video object segmentation

J Wang, D Chen, Z Wu, C Luo, C Tang… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
Exploring dense matching between the current frame and past frames for long-range context
modeling, memory-based methods have demonstrated impressive results in video object …

Edge-aware regional message passing controller for image forgery localization

D Li, J Zhu, M Wang, J Liu, X Fu… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
Digital image authenticity has promoted research on image forgery localization. Although
deep learning-based methods achieve remarkable progress, most of them usually suffer …