Self-supervised speech representation learning: A review
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …
necessitated the building of specialist models for individual tasks and application scenarios …
Speaker recognition by machines and humans: A tutorial review
Identifying a person by his or her voice is an important human trait most take for granted in
natural human-to-human interaction/communication. Speaking to someone over the …
natural human-to-human interaction/communication. Speaking to someone over the …
Agnostic federated learning
A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …
centralized model is trained based on data originating from a large number of clients. We …
Speaker verification using adapted Gaussian mixture models
Reynolds, Douglas A., Quatieri, Thomas F., and Dunn, Robert B., Speaker Verification Using
Adapted Gaussian Mixture Models, Digital Signal Processing10 (2000), 19–41. In this paper …
Adapted Gaussian Mixture Models, Digital Signal Processing10 (2000), 19–41. In this paper …
Springer Series in Statistics
Hidden Markov models—most often abbreviated to the acronym “HMMs”—are one of the
most successful statistical modelling ideas that have came up in the last forty years: the use …
most successful statistical modelling ideas that have came up in the last forty years: the use …
[PDF][PDF] Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
CJ Leggetter, PC Woodland - Computer speech & language, 1995 - eecs.yorku.ca
A method of speaker adaptation for continuous density hidden Markov models (HMMs) is
presented. An initial speaker-independent system is adapted to improve the modelling of a …
presented. An initial speaker-independent system is adapted to improve the modelling of a …
Maximum likelihood linear transformations for HMM-based speech recognition
MJF Gales - Computer speech & language, 1998 - Elsevier
This paper examines the application of linear transformations for speaker and environmental
adaptation in an HMM-based speech recognition system. In particular, transformations that …
adaptation in an HMM-based speech recognition system. In particular, transformations that …
Statistical parametric speech synthesis
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
Spoofing and countermeasures for speaker verification: A survey
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …
the technology can be susceptible to malicious spoofing attacks. The research community …
Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.
Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These map**s are not only probabilistic, but change depending on the …
categories. These map**s are not only probabilistic, but change depending on the …