Automatic language identification using deep neural networks

I Lopez-Moreno, J Gonzalez-Dominguez… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
This work studies the use of deep neural networks (DNNs) to address automatic language
identification (LID). Motivated by their recent success in acoustic modelling, we adapt DNNs …

[PDF][PDF] Automatic language identification using long short-term memory recurrent neural networks.

J Gonzalez-Dominguez, I Lopez-Moreno, H Sak… - Interspeech, 2014 - isca-archive.org
This work explores the use of Long Short-Term Memory (LSTM) recurrent neural networks
(RNNs) for automatic language identification (LID). The use of RNNs is motivated by their …

Frame-by-frame language identification in short utterances using deep neural networks

J Gonzalez-Dominguez, I Lopez-Moreno, PJ Moreno… - Neural Networks, 2015 - Elsevier
This work addresses the use of deep neural networks (DNNs) in automatic language
identification (LID) focused on short test utterances. Motivated by their recent success in …

i-Vectors in speech processing applications: a survey

P Verma, PK Das - International Journal of Speech Technology, 2015 - Springer
In the domain of speech recognition many methods have been proposed over time like
Gaussian mixture models (GMM), GMM with universal background model (GMM-UBM …

A pre-classification-based language identification for Northeast Indian languages using prosody and spectral features

C China Bhanja, MA Laskar, RH Laskar - Circuits, Systems, and Signal …, 2019 - Springer
This paper is aimed at develo** a two-stage language identification (LID) system for
Northeast Indian languages. In the first stage, languages are pre-classified into tonal and …

Improving automated scoring of prosody in oral reading fluency using deep learning algorithm

K Wang, X Qiao, G Sammit, EC Larson, J Nese… - Frontiers in …, 2024 - frontiersin.org
Automated assessing prosody of oral reading fluency presents challenges due to the
inherent difficulty of quantifying prosody. This study proposed and evaluated an approach …

[LIVRE][B] Neural models for integrating prosody in spoken language understanding

T Tran - 2020 - search.proquest.com
Prosody comprises aspects of speech that communicate information beyond written words
related to syntax, sentiment, intent, discourse, and comprehension. Decades of research …

[HTML][HTML] Deep neural network based two-stage Indian language identification system using glottal closure instants as anchor points

CC Bhanja, MA Laskar, RH Laskar… - Journal of King Saud …, 2022 - Elsevier
This paper presents a two-stage Indian language identification (TS-LID) system which is
made up of a tonal/non-tonal pre-classification and individual language identification …

Turkish dialect recognition in terms of prosodic by long short-term memory neural networks

G IŞIK, H Artuner - Journal of the Faculty of Engineering …, 2020 - avesis.hacettepe.edu.tr
Dialects are forms of speech, separated from languages which they belong to in terms of
some characteristics and which are specific to a certain region of the country. Obtaining …

Tied hidden factors in neural networks for end-to-end speaker recognition

A Miguel, J Llombart, A Ortega, E Lleida - arxiv preprint arxiv:1812.11946, 2018 - arxiv.org
In this paper we propose a method to model speaker and session variability and able to
generate likelihood ratios using neural networks in an end-to-end phrase dependent …