An overview of recent work in media forensics: Methods and threats

K Bhagtani, AKS Yadav, ER Bartusiak, Z **ang… - arxiv preprint arxiv …, 2022 - arxiv.org
In this paper, we review recent work in media forensics for digital images, video, audio
(specifically speech), and documents. For each data modality, we discuss synthesis and …

Fairssd: Understanding bias in synthetic speech detectors

AKS Yadav, K Bhagtani, D Salvi… - Proceedings of the …, 2024 - openaccess.thecvf.com
Methods that can generate synthetic speech which is perceptually indistinguishable from
speech recorded by a human speaker are easily available. Several incidents report misuse …

Audio deepfake approaches

OA Shaaban, R Yildirim, AA Alguttar - IEEE Access, 2023 - ieeexplore.ieee.org
This paper presents a review of techniques involved in the creation and detection of audio
deepfakes, the first section provides information about general deep fakes. In the second …

Synthetic speech attribution using self supervised audio spectrogram transformer

AKS Yadav, ER Bartusiak, K Bhagtani… - Electronic …, 2023 - library.imaging.org
The ability to synthesize convincing human speech has become easier due to the
availability of speech generation tools. This necessitates the development of forensics …

Synthesized speech attribution using the patchout spectrogram attribution transformer

K Bhagtani, ER Bartusiak, AKS Yadav… - Proceedings of the …, 2023 - dl.acm.org
The malicious use of synthetic speech has increased with the recent availability of speech
generation tools. It is important to determine whether a speech signal is authentic (spoken …

Are Recent Deepfake Speech Generators Detectable?

K Bhagtani, AKS Yadav, P Bestagini… - Proceedings of the 2024 …, 2024 - dl.acm.org
Deep learning methods can generate high-quality synthetic speech which is perceptually
indistinguishable from real human speech. Synthetic speech can be maliciously used for …

Compression robust synthetic speech detection using patched spectrogram transformer

AKS Yadav, Z **ang, K Bhagtani, P Bestagini… - arxiv preprint arxiv …, 2024 - arxiv.org
Many deep learning synthetic speech generation tools are readily available. The use of
synthetic speech has caused financial fraud, impersonation of people, and misinformation to …

Transformer-based speech synthesizer attribution in an open set scenario

ER Bartusiak, EJ Delp - 2022 21st IEEE International …, 2022 - ieeexplore.ieee.org
Speech synthesis methods can create realistic-sounding speech, which may be used for
fraud, spoofing, and mis-information campaigns. Forensic methods that detect synthesized …

Dsvae: Interpretable disentangled representation for synthetic speech detection

AKS Yadav, K Bhagtani, Z **ang, P Bestagini… - arxiv preprint arxiv …, 2023 - arxiv.org
Tools to generate high quality synthetic speech signal that is perceptually indistinguishable
from speech recorded from human speakers are easily available. Several approaches have …

Transformer Ensemble for Synthesized Speech Detection

ER Bartusiak, K Bhagtani, AKS Yadav… - 2023 57th Asilomar …, 2023 - ieeexplore.ieee.org
As voice synthesis systems and deep learning tools continue to improve, so does the
possibility that synthesized speech can be used for nefarious purposes. Methods that …