An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

Event-based instantaneous fundamental frequency estimation from speech signals

B Yegnanarayana, KSR Murty - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
Exploiting the impulse-like nature of excitation in the sequence of glottal cycles, a method is
proposed to derive the instantaneous fundamental frequency from speech signals. The …

Combining automatic speaker verification and prosody analysis for synthetic speech detection

L Attorresi, D Salvi, C Borrelli, P Bestagini… - … Conference on Pattern …, 2022 - Springer
The rapid spread of media content synthesis technology and the potentially damaging
impact of audio and video deepfakes on people's lives have raised the need to implement …

Variational autoencoder for prosody‐based speaker recognition

SB Alex, L Mary - ETRI Journal, 2023 - Wiley Online Library
This paper describes a novel end‐to‐end deep generative model‐based speaker
recognition system using prosodic features. The usefulness of variational autoencoders …

Automatic speaker verification system for dysarthric speakers using prosodic features and out-of-domain data augmentation

S Salim, S Shahnawazuddin, W Ahmad - Applied Acoustics, 2023 - Elsevier
A communication disorder is an impairment of a person's ability to talk or communicate
appropriately. Dysarthria is a common neuro-motor speech communication disorder that can …

Neural network based feature transformation for emotion independent speaker identification

SR Krothapalli, J Yadav, S Sarkar… - International Journal of …, 2012 - Springer
In this paper we are proposing neural network based feature transformation framework for
develo** emotion independent speaker identification system. Most of the present speaker …

Privacy Versus Emotion Preservation Trade-Offs in Emotion-Preserving Speaker Anonymization

Z Cai, HL **nyuan, A Garg… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
Advances in speech technology now allow unprecedented access to personally identifiable
information through speech. To protect such information, the differential privacy field has …

Prosodic modelling based speaker identification

KN Boubakeur, M Debyeche… - … Conference on New …, 2022 - ieeexplore.ieee.org
The use of prosodic characteristics, mainly pitch and intensity, for speaker identification in
noisy environments is examined in this work. To make the acoustic models more resistant to …

Analysis and detection of mimicked speech based on prosodic features

L Mary, KK Anish Babu, A Joseph - International Journal of Speech …, 2012 - Springer
This paper describes a work aimed towards understanding the art of mimicking by
professional mimicry artists while imitating the speech characteristics of known persons, and …

Deeptalk: Vocal style encoding for speaker recognition and speech synthesis

A Chowdhury, A Ross, P David - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Automatic speaker recognition algorithms typically characterize speech audio using short-
term spectral features that encode the physiological and anatomical aspects of speech …