Unsupervised learning of morphology

H Hammarström, L Borin - Computational Linguistics, 2011 - direct.mit.edu
This article surveys work on Unsupervised Learning of Morphology. We define
Unsupervised Learning of Morphology as the problem of inducing a description (of some …

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language–Amharic

MY Tachbelie, ST Abate, L Besacier - Speech Communication, 2014 - Elsevier
State-of-the-art large vocabulary continuous speech recognition systems use mostly phone
based acoustic models (AMs) and word based lexical and language models. However …

[PDF][PDF] Growing an n-gram language model.

V Siivola, BL Pellom - Interspeech, 2005 - aaltodoc.aalto.fi
gram models are the most widely used language models in large vocabulary continuous
speech recognition. Since the size of the model grows rapidly with respect to the model …

Devanagari OCR using a recognition driven segmentation framework and stochastic language models

S Kompalli, S Setlur, V Govindaraju - International Journal on Document …, 2009 - Springer
This paper describes a novel recognition driven segmentation methodology for Devanagari
Optical Character Recognition. Prior approaches have used sequential rules to segment …

emLam--a Hungarian Language Modeling baseline

DM Nemeskey - arxiv preprint arxiv:1701.07880, 2017 - arxiv.org
This paper aims to make up for the lack of documented baselines for Hungarian language
modeling. Various approaches are evaluated on three publicly available Hungarian corpora …

[PDF][PDF] Morpheme-based automatic speech recognition for a morphologically rich language-Amharic.

MY Tachbelie, ST Abate, W Menzel - SLTU, 2010 - academia.edu
ABSTRACT Out-of-vocabulary (OOV) words are a major source of error in a speech
recognition system and various methods have been proposed to increase the performance …

Morpheme-based modeling of pronunciation variation for large vocabulary continuous speech recognition in Korean

KN Lee, M Chung - IEICE transactions on information and systems, 2007 - search.ieice.org
This paper describes a morpheme-based pronunciation model that is especially useful to
develop the pronunciation lexicon for Large Vocabulary Continuous Speech Recognition …

[IDÉZET][C] Natural language processing methods for language modeling

[PDF][PDF] Machine learning for the analysis of morphologically complex languages

SR Spiegler - 2011 - researchgate.net
This thesis demonstrates that machine learning can be applied in different ways to automate
the analysis of morphologically complex agglutinating languages. Firstly, the target …

Morpheme-based and factored language modeling for Amharic speech recognition

MY Tachbelie, ST Abate, W Menzel - Language and Technology …, 2009 - Springer
This paper presents the application of morpheme-based and factored language models in
an Amharic speech recognition task. Since the use of morphemes in both acoustic and …