Portrait video editing empowered by multimodal generative priors

X Gao, H **ao, C Zhong, S Hu, Y Guo… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We introduce PortraitGen, a powerful portrait video editing method that achieves consistent
and expressive stylization with multimodal prompts. Traditional portrait video editing …

FRoundation: Are Foundation Models Ready for Face Recognition?

T Chettaoui, N Damer, F Boutros - arxiv preprint arxiv:2410.23831, 2024 - arxiv.org
Foundation models are predominantly trained in an unsupervised or self-supervised manner
on highly diverse and large-scale datasets, making them broadly applicable to various …

Protecting privacy in multimodal large language models with mllmu-bench

Z Liu, G Dou, M Jia, Z Tan, Q Zeng, Y Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative models such as Large Language Models (LLM) and Multimodal Large
Language models (MLLMs) trained on massive web corpora can memorize and disclose …

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere

HO Shahreza, S Marcel - arxiv preprint arxiv:2411.08470, 2024 - arxiv.org
Face recognition datasets are often collected by crawling Internet and without individuals'
consents, raising ethical and privacy concerns. Generating synthetic datasets for training …

Vec2Face: Scaling face dataset generation with loosely constrained vectors

H Wu, J Singh, S Tian, L Zheng, KW Bowyer - arxiv preprint arxiv …, 2024 - arxiv.org
This paper studies how to synthesize face images of non-existent persons, to create a
dataset that allows effective training of face recognition (FR) models. Two important goals …

Digi2real: Bridging the realism gap in synthetic data face recognition via foundation models

A George, S Marcel - arxiv preprint arxiv:2411.02188, 2024 - arxiv.org
The accuracy of face recognition systems has improved significantly in the past few years,
thanks to the large amount of data collected and advancements in neural network …

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

G Qian, KC Wang, O Patashnik, N Heravi… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce Omni-ID, a novel facial representation designed specifically for generative
tasks. Omni-ID encodes holistic information about an individual's appearance across diverse …

MONOT: High-Quality Privacy-compliant Morphed Synthetic Images for Everyone

G Borghi, N Di Domenico, M Ferrara… - … and Security (WIFS), 2024 - ieeexplore.ieee.org
Morphing Attack Detection (MAD) is a critical task in biometric security, aimed at identifying
and mitigating the risks posed by morphing attacks, where a face image is manipulated to …

Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance

D Gerogiannis, FP Papantoniou, RA Potamias… - arxiv preprint arxiv …, 2025 - arxiv.org
Inspired by the effectiveness of 3D Gaussian Splatting (3DGS) in reconstructing detailed 3D
scenes within multi-view setups and the emergence of large 2D human foundation models …

MADation: Face Morphing Attack Detection with Foundation Models

E Caldeira, G Ozgur, T Chettaoui, M Ivanovska… - arxiv preprint arxiv …, 2025 - arxiv.org
Despite the considerable performance improvements of face recognition algorithms in
recent years, the same scientific advances responsible for this progress can also be used to …