A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Support vector machines using GMM supervectors for speaker verification

WM Campbell, DE Sturim… - IEEE signal processing …, 2006 - ieeexplore.ieee.org
Gaussian mixture models (GMMs) have proven extremely successful for text-independent
speaker recognition. The standard training method for GMM models is to use MAP …

[PDF][PDF] Language recognition via i-Vectors and dimensionality reduction.

N Dehak, PA Torres-Carrasquillo, DA Reynolds… - Interspeech, 2011 - isca-archive.org
In this paper, a new language identification system is presented based on the total variability
approach previously developed in the field of speaker identification. Various techniques are …

[BOOK][B] Speaker recognition

H Beigi, H Beigi - 2011 - Springer
The objective of the enrollment process is to modify (adapt) a speaker-independent model
into one that best characterizes the target speaker's vocal tract characteristics. Depending …

[PDF][PDF] Within-class covariance normalization for SVM-based speaker recognition.

AO Hatch, SS Kajarekar, A Stolcke - Interspeech, 2006 - sri.com
This paper extends the within-class covariance normalization (WCCN) technique described
in [1, 2] for training generalized linear kernels. We describe a practical procedure for …

An overview of statistical pattern recognition techniques for speaker verification

A Fazel, S Chakrabartty - IEEE Circuits and Systems Magazine, 2011 - ieeexplore.ieee.org
Even though the subject of speaker verification has been investigated for several decades,
numerous challenges and new opportunities in robust recognition techniques are still being …

Advances in channel compensation for SVM speaker recognition

A Solomonoff, WM Campbell… - … .(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
Cross-channel degradation is one of the significant challenges facing speaker recognition
systems. We study the problem for speaker recognition using support vector machines …

Automatic speaker, age-group and gender identification from children's speech

S Safavi, M Russell, P Jančovič - Computer Speech & Language, 2018 - Elsevier
A speech signal contains important paralinguistic information, such as the identity, age,
gender, language, accent, and the emotional state of the speaker. Automatic recognition of …

[PDF][PDF] Variational domain adversarial learning for speaker verification.

Y Tu, MW Mak, JT Chien - Interspeech, 2019 - isca-archive.org
Abstract Domain mismatch refers to the problem in which the distribution of training data
differs from that of the test data. This paper proposes a variational domain adversarial neural …

Measuring, refining and calibrating speaker and language information extracted from speech

N Brummer - 2010 - scholar.sun.ac.za
We propose a new methodology, based on proper scoring rules, for the evaluation of the
goodness of pattern recognizers with probabilistic outputs. The recognizers of interest take …