- Academic Search

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024‏ - dl.acm.org‏

Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …‏

שמור צטט צוטט על ידי 99 מאמרים בנושא זה

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?‏

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …‏

שמור צטט צוטט על ידי 210 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Gaussianavatars: Photorealistic head avatars with rigged 3d gaussians‏

S Qian, T Kirschstein, L Schoneveld… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

We introduce GaussianAvatars a new method to create photorealistic head avatars that are
fully controllable in terms of expression pose and viewpoint. The core idea is a dynamic 3D …‏

שמור צטט צוטט על ידי 121 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Vasa-1: Lifelike audio-driven talking faces generated in real time‏

S Xu, G Chen, YX Guo, J Yang, C Li… - Advances in …, 2025‏ - proceedings.neurips.cc‏

We introduce VASA, a framework for generating lifelike talking faces with appealing visual
affective skills (VAS) given a single static image and a speech audio clip. Our premiere …‏

שמור צטט צוטט על ידי 62 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

End-to-end reconstruction-classification learning for face forgery detection‏

J Cao, C Ma, T Yao, S Chen… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Existing face forgery detectors mainly focus on specific forgery patterns like noise
characteristics, local textures, or frequency statistics for forgery detection. This causes …‏

שמור צטט צוטט על ידי 258 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Diffused heads: Diffusion models beat gans on talking-face generation‏

M Stypułkowski, K Vougioukas, S He… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

Talking face generation has historically struggled to produce head movements and natural
facial expressions without guidance from additional reference videos. Recent developments …‏

שמור צטט צוטט על ידי 140 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Deepfake detection: A systematic literature review‏

MS Rana, MN Nobi, B Murali, AH Sung - IEEE access, 2022‏ - ieeexplore.ieee.org‏

Over the last few decades, rapid progress in AI, machine learning, and deep learning has
resulted in new techniques and various tools for manipulating multimedia. Though the …‏

שמור צטט צוטט על ידי 318 מאמרים בנושא זה כל 6 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Dynamic graph learning with content-guided spatial-frequency relation reasoning for deepfake detection‏

Y Wang, K Yu, C Chen, X Hu… - Proceedings of the IEEE …, 2023‏ - openaccess.thecvf.com‏

With the springing up of face synthesis techniques, it is prominent in need to develop
powerful face forgery detection methods due to security concerns. Some existing methods …‏

שמור צטט צוטט על ידי 93 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] pnas.org Full View‏

AI-synthesized faces are indistinguishable from real faces and more trustworthy‏

SJ Nightingale, H Farid - Proceedings of the National Academy of …, 2022‏ - pnas.org‏

Artificial intelligence (AI)–synthesized text, audio, image, and video are being weaponized
for the purposes of nonconsensual intimate imagery, financial fraud, and disinformation …‏

שמור צטט צוטט על ידי 299 מאמרים בנושא זה כל 10 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Difftalk: Crafting diffusion models for generalized audio-driven portraits animation‏

S Shen, W Zhao, Z Meng, W Li… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Talking head synthesis is a promising approach for the video production industry. Recently,
a lot of effort has been devoted in this research area to improve the generation quality or …‏

שמור צטט צוטט על ידי 111 מאמרים בנושא זה כל 8 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Synthesizing obama: learning lip sync from audio

Foundations & trends in multimodal machine learning: Principles, challenges, and open questions‏

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?‏

Gaussianavatars: Photorealistic head avatars with rigged 3d gaussians‏

Vasa-1: Lifelike audio-driven talking faces generated in real time‏

End-to-end reconstruction-classification learning for face forgery detection‏

Diffused heads: Diffusion models beat gans on talking-face generation‏

Deepfake detection: A systematic literature review‏

Dynamic graph learning with content-guided spatial-frequency relation reasoning for deepfake detection‏

AI-synthesized faces are indistinguishable from real faces and more trustworthy‏

Difftalk: Crafting diffusion models for generalized audio-driven portraits animation‏