A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

[КНИГА][B] Machine learning for speaker recognition

MW Mak, JT Chien - 2020 - books.google.com
This book will help readers understand fundamental and advanced statistical models and
deep learning models for robust speaker recognition and domain adaptation. This useful …

Curriculum learning based approaches for noise robust speaker recognition

S Ranjan, JHL Hansen - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Performance of speaker identification (SID) systems is known to degrade rapidly in the
presence of mismatch such as noise and channel degradations. This study introduces a …

The ibm speaker recognition system: Recent advances and error analysis

SO Sadjadi, J Pelecanos, S Ganapathy - arxiv preprint arxiv:1605.01635, 2016 - arxiv.org
We present the recent advances along with an error analysis of the IBM speaker recognition
system for conversational speech. Some of the key advancements that contribute to our …

Mixture of PLDA for noise robust i-vector speaker verification

MW Mak, X Pang, JT Chien - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
In real-world environments, noisy utterances with variable noise levels are recorded and
then converted to i-vectors for cosine distance or PLDA scoring. This paper investigates the …

The IBM 2016 speaker recognition system

SO Sadjadi, S Ganapathy, JW Pelecanos - arxiv preprint arxiv …, 2016 - arxiv.org
In this paper we describe the recent advancements made in the IBM i-vector speaker
recognition system for conversational speech. In particular, we identify key techniques that …

Local pairwise linear discriminant analysis for speaker verification

L He, X Chen, C Xu, J Liu… - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
Linear discriminant analysis—probabilistic linear discriminant analysis (LDA-PLDA) is a
standard and effective backend in the field of speaker verification. The object of LDA is to …

Modelling and compensation for language mismatch in speaker verification

A Misra, JHL Hansen - Speech Communication, 2018 - Elsevier
Abstract Language mismatch represents one of the more difficult challenges in achieving
effective speaker verification in naturalistic audio streams. The portion of bi-lingual speakers …

i-vector/PLDA speaker recognition using support vectors with discriminant analysis

F Bahmaninezhad, JHL Hansen - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
i-Vector feature representation with probabilistic linear discriminant analysis (PLDA) scoring
in speaker recognition system has recently achieved effective permanence even on channel …

Deep discriminant analysis for i-vector based robust speaker recognition

S Wang, Z Huang, Y Qian, K Yu - 2018 11th International …, 2018 - ieeexplore.ieee.org
Linear Discriminant Analysis (LDA) has been used as a standard post-processing procedure
in many state-of-the-art speaker recognition tasks. Through maximizing the inter-speaker …