Sub-lexical language models with word level pronunciation lexicons
H Sak, M Saraclar - US Patent 9,292,489, 2016 - Google Patents
An automatic speech recognition (ASR) system and method are provided for using sub-
lexical language models together with word level pronunciation lexicons. These approaches …
lexical language models together with word level pronunciation lexicons. These approaches …
Lattice indexing for spoken term detection
This paper considers the problem of constructing an efficient inverted index for the spoken
term detection (STD) task. More specifically, we construct a deterministic weighted finite …
term detection (STD) task. More specifically, we construct a deterministic weighted finite …
Morfessor 2.0: Toolkit for statistical morphological segmentation
Morfessor is a family of probabilistic machine learning methods forfinding the morphological
segmentation from raw text data. Recentdevelopments include the development of semi …
segmentation from raw text data. Recentdevelopments include the development of semi …
Multilingual speech recognition for Turkic languages
The primary aim of this study was to contribute to the development of multilingual automatic
speech recognition for lower-resourced Turkic languages. Ten languages—Azerbaijani …
speech recognition for lower-resourced Turkic languages. Ten languages—Azerbaijani …
[HTML][HTML] Advances in subword-based HMM-DNN speech recognition across languages
We describe a novel way to implement subword language models in speech recognition
systems based on weighted finite state transducers, hidden Markov models, and deep …
systems based on weighted finite state transducers, hidden Markov models, and deep …
Spoken content retrieval: A survey of techniques and technologies
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …
recent years. Large collections are accruing on the Internet as well as in private and …
Improved subword modeling for WFST-based speech recognition
Because in agglutinative languages the number of observed word forms is very high,
subword units are often utilized in speech recognition. However, the proper use of subword …
subword units are often utilized in speech recognition. However, the proper use of subword …
Resources for Turkish natural language processing: A critical survey
This paper presents a comprehensive survey of corpora and lexical resources available for
Turkish. We review a broad range of resources, focusing on the ones that are publicly …
Turkish. We review a broad range of resources, focusing on the ones that are publicly …
Alternative structures for character-level RNNs
Recurrent neural networks are convenient and efficient models for language modeling.
However, when applied on the level of characters instead of words, they suffer from several …
However, when applied on the level of characters instead of words, they suffer from several …
A detailed survey of Turkish automatic speech recognition
Significant improvements have been made in automatic speech recognition (ASR) systems
in terms of both the general technology and the software used. Despite these …
in terms of both the general technology and the software used. Despite these …