Continuous speech recognition technologies—a review

S Bhatt, A Jain, A Dev - … Developments in Acoustics: Select Proceedings of …, 2021 - Springer
Speech recognition is the most emerging field of research, as speech is the natural way of
communication. This paper presents the different technologies used for continuous speech …

A smartphone-based ASR data collection tool for under-resourced languages

NJ De Vries, MH Davel, J Badenhorst, WD Basson… - Speech …, 2014 - Elsevier
Acoustic data collection for automatic speech recognition (ASR) purposes is a particularly
challenging task when working with under-resourced languages, many of which are found in …

The NCHLT speech corpus of the South African languages

E Barnard, MH Davel, C van Heerden, F De Wet… - 2014 - repository.nwu.ac.za
The NCHLT speech corpus contains wide-band speech from approximately 200 speakers
per language, in each of the eleven official languages of South Africa. We describe the …

[PDF][PDF] A systematic analysis of automatic speech recognition: an overview

T Gulzar, A Singh, DK Rajoriya… - International Journal of …, 2014 - academia.edu
Most high-flying and primary means of communication among humans is speech. Despite
the researches and developments in the field of automatic speech recognition the accuracy …

Feature extraction techniques with analysis of confusing words for speech recognition in the Hindi language

S Bhatt, A Jain, A Dev - Wireless Personal Communications, 2021 - Springer
The research work presents experimental work to build a speaker-independent connected
word Hindi speech recognition system using different feature extraction techniques with …

Using out-of-language data to improve an under-resourced speech recognizer

D Imseng, P Motlicek, H Bourlard, PN Garner - Speech communication, 2014 - Elsevier
Under-resourced speech recognizers may benefit from data in languages other than the
target language. In this paper, we report how to boost the performance of an Afrikaans …

Two sepedi-english code-switched speech corpora

TI Modipa, MH Davel - Language Resources and Evaluation, 2022 - Springer
We report on the development of two reference corpora for the analysis of Sepedi-English
code-switched speech in the context of automatic speech recognition. For the first corpus …

Comparison of grapheme-to-phoneme conversion methods on a myanmar pronunciation dictionary

YK Thu, WP Pa, Y Sagisaka… - Proceedings of the 6th …, 2016 - aclanthology.org
Abstract Grapheme-to-Phoneme (G2P) conversion is the task of predicting the pronunciation
of a word given its graphemic or written form. It is a highly important part of both automatic …

Grapheme-to-phoneme model generation for Indo-European languages

T Schlippe, S Ochs, T Schultz - 2012 IEEE International …, 2012 - ieeexplore.ieee.org
In this paper, we evaluate grapheme-to-phoneme (g2p) models among languages and of
different quality. We created g2p models for Indo-European languages with word …

Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition

D Imseng, P Motlicek, PN Garner… - 2013 IEEE Workshop …, 2013 - ieeexplore.ieee.org
Posterior based acoustic modeling techniques such as Kullback-Leibler divergence based
HMM (KL-HMM) and Tandem are able to exploit out-of-language data through posterior …