An overview of text-independent speaker recognition: From features to supervectors
T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …
emphasis on text-independent recognition. Speaker recognition has been studied actively …
Event-based instantaneous fundamental frequency estimation from speech signals
B Yegnanarayana, KSR Murty - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
Exploiting the impulse-like nature of excitation in the sequence of glottal cycles, a method is
proposed to derive the instantaneous fundamental frequency from speech signals. The …
proposed to derive the instantaneous fundamental frequency from speech signals. The …
Combining automatic speaker verification and prosody analysis for synthetic speech detection
The rapid spread of media content synthesis technology and the potentially damaging
impact of audio and video deepfakes on people's lives have raised the need to implement …
impact of audio and video deepfakes on people's lives have raised the need to implement …
Variational autoencoder for prosody‐based speaker recognition
This paper describes a novel end‐to‐end deep generative model‐based speaker
recognition system using prosodic features. The usefulness of variational autoencoders …
recognition system using prosodic features. The usefulness of variational autoencoders …
Automatic speaker verification system for dysarthric speakers using prosodic features and out-of-domain data augmentation
A communication disorder is an impairment of a person's ability to talk or communicate
appropriately. Dysarthria is a common neuro-motor speech communication disorder that can …
appropriately. Dysarthria is a common neuro-motor speech communication disorder that can …
Neural network based feature transformation for emotion independent speaker identification
SR Krothapalli, J Yadav, S Sarkar… - International Journal of …, 2012 - Springer
In this paper we are proposing neural network based feature transformation framework for
develo** emotion independent speaker identification system. Most of the present speaker …
develo** emotion independent speaker identification system. Most of the present speaker …
Privacy Versus Emotion Preservation Trade-Offs in Emotion-Preserving Speaker Anonymization
Z Cai, HL **nyuan, A Garg… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
Advances in speech technology now allow unprecedented access to personally identifiable
information through speech. To protect such information, the differential privacy field has …
information through speech. To protect such information, the differential privacy field has …
Prosodic modelling based speaker identification
KN Boubakeur, M Debyeche… - … Conference on New …, 2022 - ieeexplore.ieee.org
The use of prosodic characteristics, mainly pitch and intensity, for speaker identification in
noisy environments is examined in this work. To make the acoustic models more resistant to …
noisy environments is examined in this work. To make the acoustic models more resistant to …
Analysis and detection of mimicked speech based on prosodic features
L Mary, KK Anish Babu, A Joseph - International Journal of Speech …, 2012 - Springer
This paper describes a work aimed towards understanding the art of mimicking by
professional mimicry artists while imitating the speech characteristics of known persons, and …
professional mimicry artists while imitating the speech characteristics of known persons, and …
Deeptalk: Vocal style encoding for speaker recognition and speech synthesis
Automatic speaker recognition algorithms typically characterize speech audio using short-
term spectral features that encode the physiological and anatomical aspects of speech …
term spectral features that encode the physiological and anatomical aspects of speech …