Modeling pronunciation variation for ASR: A survey of the literature

H Strik, C Cucchiarini - Speech Communication, 1999 - Elsevier
The focus in automatic speech recognition (ASR) research has gradually shifted from
isolated words to conversational speech. Consequently, the amount of pronunciation …

Weighted finite-state transducers in speech recognition

M Mohri, F Pereira, M Riley - Computer Speech & Language, 2002 - Elsevier
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We
show that WFSTs provide a common and natural representation for hidden Markov models …

Speech recognition with weighted finite-state transducers

M Mohri, F Pereira, M Riley - Springer Handbook of Speech Processing, 2008 - Springer
This chapter describes a general representation and algorithmic framework for speech
recognition based on weighted finite-state transducers. These transducers provide a …

Speaking in shorthand–A syllable-centric perspective for understanding pronunciation variation

S Greenberg - Speech Communication, 1999 - Elsevier
Current-generation automatic speech recognition (ASR) systems model spoken discourse
as a quasi-linear sequence of words and phones. Because it is unusual for every phone …

Recognizing speech of goats, wolves, sheep and… non-natives

D Van Compernolle - Speech Communication, 2001 - Elsevier
This paper reviews the current understanding of acoustic–phonetic issues and the problems
arising when trying to recognize speech from non-native speakers. Conceptually, regional …

Effects of speaking rate and word frequency on pronunciations in convertional speech

E Fosler-Lussier, N Morgan - Speech Communication, 1999 - Elsevier
Automatic speech recognition (ASR) systems typically have a static dictionary of word
pronunciations for matching acoustic models to words. In this work, we argue that, in fact …

Syllable-based large vocabulary continuous speech recognition

A Ganapathiraju, J Hamaker, J Picone… - … on speech and …, 2001 - ieeexplore.ieee.org
Most large vocabulary continuous speech recognition (LVCSR) systems in the past decade
have used a context-dependent (CD) phone as the fundamental acoustic unit. We present …

[PDF][PDF] Moving beyond the 'beads-on-a-string'model of speech

M Ostendorf - Proc. IEEE ASRU Workshop, 1999 - Citeseer
The notion that a word is composed of a sequence of phone segments, sometimes referred
to as 'beads on a string', has formed the basis of most speech recognition work for over 15 …

Artificial Intelligence is Awesome, but Good Teaching Should Always Come First.

J Crawford, C Vallis, J Yang, R Fitzgerald… - Journal of University …, 2023 - ro.uow.edu.au
The explosion of generative artificial intelligence into the mainstream of society some twelve
months ago has seriously challenged learning and teaching practice. Since then, AI …

Accent issues in large vocabulary continuous speech recognition

C Huang, T Chen, E Chang - International Journal of Speech Technology, 2004 - Springer
This paper addresses accent 1 issues in large vocabulary continuous speech recognition.
Cross-accent experiments show that the accent problem is very dominant in speech …