الباحث العلمي من Google

X Guo, X Liu, Z Ren, S Grosz… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Differences in forgery attributes of images generated in CNN-synthesized and image-editing
domains are large, and such differences make a unified image forgery detection and …‏

حفظ اقتباس تم اقتباسها في عدد: 131 مقالات ذات صلة الإصدارات الـ 12كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Explicit visual prompting for low-level structure segmentations‏

W Liu, X Shen, CM Pun, X Cun - Proceedings of the IEEE …, 2023‏ - openaccess.thecvf.com‏

We consider the generic problem of detecting low-level structures in images, which includes
segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow …‏

حفظ اقتباس تم اقتباسها في عدد: 134 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Bevt: Bert pretraining of video transformers‏

R Wang, D Chen, Z Wu, Y Chen… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …‏

حفظ اقتباس تم اقتباسها في عدد: 259 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Adavit: Adaptive vision transformers for efficient image recognition‏

L Meng, H Li, BC Chen, S Lan, Z Wu… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Built on top of self-attention mechanisms, vision transformers have demonstrated
remarkable performance on a variety of vision tasks recently. While achieving excellent …‏

حفظ اقتباس تم اقتباسها في عدد: 262 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Wave-vit: Unifying wavelet and transformers for visual representation learning‏

T Yao, Y Pan, Y Li, CW Ngo, T Mei - European Conference on Computer …, 2022‏ - Springer‏

Abstract Multi-scale Vision Transformer (ViT) has emerged as a powerful backbone for
computer vision tasks, while the self-attention computation in Transformer scales …‏

حفظ اقتباس تم اقتباسها في عدد: 166 مقالات ذات صلة الإصدارات الـ 7كلها

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Omnitokenizer: A joint image-video tokenizer for visual generation‏

J Wang, Y Jiang, Z Yuan, B Peng… - Advances in Neural …, 2025‏ - proceedings.neurips.cc‏

Tokenizer, serving as a translator to map the intricate visual data into a compact latent
space, lies at the core of visual generative models. Based on the finding that existing …‏

حفظ اقتباس تم اقتباسها في عدد: 18 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Trufor: Leveraging all-round clues for trustworthy image forgery detection and localization‏

F Guillaro, D Cozzolino, A Sud… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

In this paper we present TruFor, a forensic framework that can be applied to a large variety
of image manipulation methods, from classic cheapfakes to more recent manipulations …‏

حفظ اقتباس تم اقتباسها في عدد: 121 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

M2tr: Multi-modal multi-scale transformers for deepfake detection‏

J Wang, Z Wu, W Ouyang, X Han, J Chen… - Proceedings of the …, 2022‏ - dl.acm.org‏

The widespread dissemination of Deepfakes demands effective approaches that can detect
perceptually convincing forged images. In this paper, we aim to capture the subtle …‏

حفظ اقتباس تم اقتباسها في عدد: 290 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Look before you match: Instance understanding matters in video object segmentation‏

J Wang, D Chen, Z Wu, C Luo, C Tang… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Exploring dense matching between the current frame and past frames for long-range context
modeling, memory-based methods have demonstrated impressive results in video object …‏

حفظ اقتباس تم اقتباسها في عدد: 51 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Edge-aware regional message passing controller for image forgery localization‏

D Li, J Zhu, M Wang, J Liu, X Fu… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Digital image authenticity has promoted research on image forgery localization. Although
deep learning-based methods achieve remarkable progress, most of them usually suffer …‏

حفظ اقتباس تم اقتباسها في عدد: 39 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Objectformer for image manipulation detection and localization

Hierarchical fine-grained image forgery detection and localization‏

Explicit visual prompting for low-level structure segmentations‏

Bevt: Bert pretraining of video transformers‏

Adavit: Adaptive vision transformers for efficient image recognition‏

Wave-vit: Unifying wavelet and transformers for visual representation learning‏

Omnitokenizer: A joint image-video tokenizer for visual generation‏

Trufor: Leveraging all-round clues for trustworthy image forgery detection and localization‏

M2tr: Multi-modal multi-scale transformers for deepfake detection‏

Look before you match: Instance understanding matters in video object segmentation‏

Edge-aware regional message passing controller for image forgery localization‏