Study of senone-based deep neural network approaches for spoken language recognition

L Ferrer, Y Lei, M McLaren… - IEEE/ACM Transactions …, 2015 - ieeexplore.ieee.org
This paper compares different approaches for using deep neural networks (DNNs) trained to
predict senone posteriors for the task of spoken language recognition (SLR). These …

[PDF][PDF] Application of Convolutional Neural Networks to Language Identification in Noisy Conditions.

Y Lei, L Ferrer, A Lawson, M McLaren, N Scheffer - Odyssey, 2014 - isca-archive.org
This paper proposes two novel frontends for robust language identification (LID) using a
convolutional neural network (CNN) trained for automatic speech recognition (ASR). In the …

[PDF][PDF] An end-to-end approach to language identification in short utterances using convolutional neural networks.

A Lozano-Diez, R Zazo-Candil… - Interspeech, 2015 - isca-archive.org
In this work, we propose an end-to-end approach to the language identification (LID)
problem based on Convolutional Deep Neural Networks (CDNNs). The use of CDNNs is …

Exploring the role of phonetic bottleneck features for speaker and language recognition

M McLaren, L Ferrer, A Lawson - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Using bottleneck features extracted from a deep neural network (DNN) trained to predict
senone posteriors has resulted in new, state-of-the-art technology for language and speaker …

On the use of phone log-likelihood ratios as features in spoken language recognition

M Diez, A Varona, M Penagarikano… - 2012 IEEE Spoken …, 2012 - ieeexplore.ieee.org
This paper presents an alternative feature set to the traditional MFCC-SDC used in acoustic
approaches to Spoken Language Recognition: the log-likelihood ratios of phone posterior …

Joint information from nonlinear and linear features for spoofing detection: An i-vector/DNN based approach

C Zhang, S Ranjan, MK Nandwana… - … , Speech and Signal …, 2016 - ieeexplore.ieee.org
Sustaining automatic speaker verification (ASV) systems from spoofing attacks remains an
essential challenge, even if significant progress in ASV has been achieved in recent years …

Language recognition using deep neural networks with very limited training data

S Ranjan, C Yu, C Zhang, F Kelly… - … on Acoustics, Speech …, 2016 - ieeexplore.ieee.org
This study proposes a novel deep neural network (DNN) based approach to language
identification (LID) for the NIST 2015 Language Recognition (LRE) i-Vector Machine …

[PDF][PDF] Adaptive gaussian backend for robust language identification.

M McLaren, A Lawson, Y Lei, N Scheffer - Interspeech, 2013 - sri.com
This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust
language identification (LID). In this approach, a given test sample is compared to language …

[PDF][PDF] Improving language identification robustness to highly channel-degraded speech through multiple system fusion.

A Lawson, M McLaren, Y Lei, V Mitra, N Scheffer… - …, 2013 - isca-archive.org
We describe a language identification system developed for robustess to noise conditions
such as those encountered under the DARPA RATS program, which is focused on multi …

[PDF][PDF] Robust language recognition based on diverse features

Q Zhang, G Liu, JHL Hansen - ODYSSEY: The speaker and …, 2014 - researchgate.net
In real scenarios, robust language identification (LID) is usually hindered by factors such as
background noise, channel, and speech duration mismatches. To address these issues, this …