Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Speaker recognition by machines and humans: A tutorial review

JHL Hansen, T Hasan - IEEE Signal processing magazine, 2015 - ieeexplore.ieee.org
Identifying a person by his or her voice is an important human trait most take for granted in
natural human-to-human interaction/communication. Speaking to someone over the …

Agnostic federated learning

M Mohri, G Sivek, AT Suresh - International conference on …, 2019 - proceedings.mlr.press
A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …

Speaker verification using adapted Gaussian mixture models

DA Reynolds, TF Quatieri, RB Dunn - Digital signal processing, 2000 - Elsevier
Reynolds, Douglas A., Quatieri, Thomas F., and Dunn, Robert B., Speaker Verification Using
Adapted Gaussian Mixture Models, Digital Signal Processing10 (2000), 19–41. In this paper …

Springer Series in Statistics

P Bickel, P Diggle, S Fienberg, U Gather - 2005 - Springer
Hidden Markov models—most often abbreviated to the acronym “HMMs”—are one of the
most successful statistical modelling ideas that have came up in the last forty years: the use …

[PDF][PDF] Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models

CJ Leggetter, PC Woodland - Computer speech & language, 1995 - eecs.yorku.ca
A method of speaker adaptation for continuous density hidden Markov models (HMMs) is
presented. An initial speaker-independent system is adapted to improve the modelling of a …

Maximum likelihood linear transformations for HMM-based speech recognition

MJF Gales - Computer speech & language, 1998 - Elsevier
This paper examines the application of linear transformations for speaker and environmental
adaptation in an HMM-based speech recognition system. In particular, transformations that …

Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org
Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These map**s are not only probabilistic, but change depending on the …