V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

X Zhang, Y Xu, R Li, J Yu, W Li, Z Xu… - Proceedings of the 32nd …, 2024 - dl.acm.org
AI-generated video has revolutionized short video production, filmmaking, and personalized
media, making video local editing an essential tool. However, this progress also blurs the …

Not my voice! a taxonomy of ethical and safety harms of speech generators

W Hutiri, O Papakyriakopoulos, A **ang - Proceedings of the 2024 ACM …, 2024 - dl.acm.org
The rapid and wide-scale adoption of AI to generate human speech poses a range of
significant ethical and safety risks to society that need to be addressed. For example, a …

Images that sound: Composing images and sounds on a single canvas

Z Chen, D Geng, A Owens - Advances in Neural …, 2025 - proceedings.neurips.cc
Spectrograms are 2D representations of sound that look very different from the images found
in our visual world. And natural images, when played as spectrograms, make unnatural …

Gs-hider: Hiding messages into 3d gaussian splatting

X Zhang, J Meng, R Li, Z Xu… - Advances in Neural …, 2025 - proceedings.neurips.cc
Abstract 3D Gaussian Splatting (3DGS) has already become the emerging research focus in
the fields of 3D scene reconstruction and novel view synthesis. Given that training a 3DGS …

Can Simple Averaging Defeat Modern Watermarks?

P Yang, H Ci, Y Song, MZ Shou - Advances in Neural …, 2025 - proceedings.neurips.cc
Digital watermarking techniques are crucial for copyright protection and source identification
of images, especially in the era of generative AI models. However, many existing …

Meta learning text-to-speech synthesis in over 7000 languages

F Lux, S Meyer, L Behringer, F Zalkow, P Do… - arxiv preprint arxiv …, 2024 - arxiv.org
In this work, we take on the challenging task of building a single text-to-speech synthesis
system that is capable of generating speech in over 7000 languages, many of which lack …

Audio codec augmentation for robust collaborative watermarking of speech synthesis

L Juvela, X Wang - arxiv preprint arxiv:2409.13382, 2024 - arxiv.org
Automatic detection of synthetic speech is becoming increasingly important as current
synthesis methods are both near indistinguishable from human speech and widely …

Steganalysis on Digital Watermarking: Is Your Defense Truly Impervious?

P Yang, H Ci, Y Song, MZ Shou - arxiv preprint arxiv:2406.09026, 2024 - arxiv.org
Digital watermarking techniques are crucial for copyright protection and source identification
of images, especially in the era of generative AI models. However, many existing …

GROOT: Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis

W Liu, Y Li, D Lin, H Tian, H Li - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Amid the burgeoning development of generative models like diffusion models, the task of
differentiating synthesized audio from its natural counterpart grows more daunting …

SoK: Watermarking for AI-Generated Content

X Zhao, S Gunn, M Christ, J Fairoze, A Fabrega… - arxiv preprint arxiv …, 2024 - arxiv.org
As the outputs of generative AI (GenAI) techniques improve in quality, it becomes
increasingly challenging to distinguish them from human-created content. Watermarking …