Automatic speech recognition using limited vocabulary: A survey

JLKE Fendji, DCM Tala, BO Yenke… - Applied Artificial …, 2022 - Taylor & Francis
ABSTRACT Automatic Speech Recognition (ASR) is an active field of research due to its
large number of applications and the proliferation of interfaces or computing devices that …

Speech production knowledge in automatic speech recognition

S King, J Frankel, K Livescu, E McDermott… - The Journal of the …, 2007 - pubs.aip.org
Although much is known about how speech is produced, and research into speech
production has resulted in measured articulatory data, feature systems of different kinds, and …

Effects of disfluencies, predictability, and utterance position on word form variation in English conversation

A Bell, D Jurafsky, E Fosler-Lussier, C Girand… - The Journal of the …, 2003 - pubs.aip.org
Function words, especially frequently occurring ones such as (the, that, and, and of), vary
widely in pronunciation. Understanding this variation is essential both for cognitive modeling …

Pliability rules

A Bell, G Parchomovsky - Mich. L. Rev., 2002 - HeinOnline
In 1543, the Polish astronomer, Nicolas Copernicus, determined the heliocentric design of
the solar system.'Copernicus was motivated in large part by the conviction that Claudius …

Graphical models and automatic speech recognition

JA Bilmes - Mathematical foundations of speech and language …, 2004 - Springer
Graphical models provide a promising paradigm to study both existing and novel techniques
for automatic speech recognition. This paper first provides a brief overview of graphical …

CTC regularized model adaptation for improving LSTM RNN based multi-accent mandarin speech recognition

J Yi, Z Wen, J Tao, H Ni, B Liu - Journal of Signal Processing Systems, 2018 - Springer
This paper proposes a novel regularized adaptation method to improve the performance of
multi-accent Mandarin speech recognition task. The acoustic model is based on long short …

[PDF][PDF] Prosody models for conversational speech recognition

M OstendorfÝ, I ShafranÞ, R Bates - 2003 - Citeseer
This paper describes a formal model for incorporating prosody in the speech recognition
process, both for improving word recognition directly and for jointly recognizing words and …

[PDF][PDF] Fundamental technologies in modern speech recognition

T OCKPH - IEEE Signal Processing Magazine, 2012 - Citeseer
There is a vast body of literature on LVCSR research and some limitation is necessary in the
scope of this article. We will focus primarily on the techniques that have been successful in …

What kind of pronunciation variation is hard for triphones to model?

D Jurafsky, W Ward, Z Ban**… - … , Speech, and Signal …, 2001 - ieeexplore.ieee.org
In order to help understand why gains in pronunciation modeling have proven so elusive,
we investigated which kinds of pronunciation variation are well captured by triphone models …

Pronunciation modeling for ASR–knowledge-based and data-derived methods

M Wester - Computer Speech & Language, 2003 - Elsevier
This paper focuses on modeling pronunciation variation in two different ways: data-derived
and knowledge-based. The knowledge-based approach consists of using phonological …