Modeling pronunciation variation for ASR: A survey of the literature
H Strik, C Cucchiarini - Speech Communication, 1999 - Elsevier
The focus in automatic speech recognition (ASR) research has gradually shifted from
isolated words to conversational speech. Consequently, the amount of pronunciation …
isolated words to conversational speech. Consequently, the amount of pronunciation …
Weighted finite-state transducers in speech recognition
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We
show that WFSTs provide a common and natural representation for hidden Markov models …
show that WFSTs provide a common and natural representation for hidden Markov models …
Speech recognition with weighted finite-state transducers
This chapter describes a general representation and algorithmic framework for speech
recognition based on weighted finite-state transducers. These transducers provide a …
recognition based on weighted finite-state transducers. These transducers provide a …
Speaking in shorthand–A syllable-centric perspective for understanding pronunciation variation
S Greenberg - Speech Communication, 1999 - Elsevier
Current-generation automatic speech recognition (ASR) systems model spoken discourse
as a quasi-linear sequence of words and phones. Because it is unusual for every phone …
as a quasi-linear sequence of words and phones. Because it is unusual for every phone …
Recognizing speech of goats, wolves, sheep and… non-natives
D Van Compernolle - Speech Communication, 2001 - Elsevier
This paper reviews the current understanding of acoustic–phonetic issues and the problems
arising when trying to recognize speech from non-native speakers. Conceptually, regional …
arising when trying to recognize speech from non-native speakers. Conceptually, regional …
Effects of speaking rate and word frequency on pronunciations in convertional speech
Automatic speech recognition (ASR) systems typically have a static dictionary of word
pronunciations for matching acoustic models to words. In this work, we argue that, in fact …
pronunciations for matching acoustic models to words. In this work, we argue that, in fact …
Syllable-based large vocabulary continuous speech recognition
Most large vocabulary continuous speech recognition (LVCSR) systems in the past decade
have used a context-dependent (CD) phone as the fundamental acoustic unit. We present …
have used a context-dependent (CD) phone as the fundamental acoustic unit. We present …
[PDF][PDF] Moving beyond the 'beads-on-a-string'model of speech
M Ostendorf - Proc. IEEE ASRU Workshop, 1999 - Citeseer
The notion that a word is composed of a sequence of phone segments, sometimes referred
to as 'beads on a string', has formed the basis of most speech recognition work for over 15 …
to as 'beads on a string', has formed the basis of most speech recognition work for over 15 …
Artificial Intelligence is Awesome, but Good Teaching Should Always Come First.
The explosion of generative artificial intelligence into the mainstream of society some twelve
months ago has seriously challenged learning and teaching practice. Since then, AI …
months ago has seriously challenged learning and teaching practice. Since then, AI …
Accent issues in large vocabulary continuous speech recognition
This paper addresses accent 1 issues in large vocabulary continuous speech recognition.
Cross-accent experiments show that the accent problem is very dominant in speech …
Cross-accent experiments show that the accent problem is very dominant in speech …