Unsupervised learning of morphology

H Hammarström, L Borin - Computational Linguistics, 2011 - direct.mit.edu
This article surveys work on Unsupervised Learning of Morphology. We define
Unsupervised Learning of Morphology as the problem of inducing a description (of some …

[PDF][PDF] Unsupervised morphological segmentation with log-linear models

H Poon, C Cherry, K Toutanova - … of the North American Chapter of …, 2009 - aclanthology.org
Morphological segmentation breaks words into morphemes (the basic semantic units). It is a
key component for natural language processing systems. Unsupervised morphological …

Structures and distributions in morphology learning

E Chan - 2008 - search.proquest.com
One of the great challenges in linguistics and cognitive science is to understand the nature
of the mental representation of language. The precise mechanisms of the mind are …

Challenging language-dependent segmentation for Arabic: An application to machine translation and part-of-speech tagging

H Sajjad, F Dalvi, N Durrani, A Abdelali… - arxiv preprint arxiv …, 2017 - arxiv.org
Word segmentation plays a pivotal role in improving any Arabic NLP application. Therefore,
a lot of research has been spent in improving its accuracy. Off-the-shelf tools, however, are …

ParaMor: Finding paradigms across morphology

C Monson, J Carbonell, A Lavie, L Levin - Advances in Multilingual and …, 2008 - Springer
ParaMor automatically learns morphological paradigms from unlabelled text, and uses them
to annotate word forms with morpheme boundaries. ParaMor competed in the English and …

[PDF][PDF] Computational modeling of agglutinative languages: the challenge for southern bantu languages

F Kambarami, S McLachlan, B Bozic… - Arusha Work. Pap. Afr …, 2021 - academia.edu
In computational linguistics, language models are probabilistic models that predict the
likelihood of words occurring within specific sentences. They are key components of many …

Morphological processing of compounds for statistical machine translation

F Cap - 2014 - elib.uni-stuttgart.de
Machine Translation denotes the translation of a text written in one language into another
language performed by a computer program. In times of internet and globalisation, there has …

[PDF][PDF] Phonological constraints and morphological preprocessing for grapheme-to-phoneme conversion

V Demberg, H Schmid, G Möhler - … of the 45th Annual Meeting of …, 2007 - aclanthology.org
Grapheme-to-phoneme conversion (g2p) is a core component of any text-to-speech system.
We show that adding simple syllabification and stress assignment constraints, namely 'one …

Non-canonical inflection: data, formalisation and complexity measures

B Sagot, G Walther - … : Second International Workshop, SFCM 2011, Zurich …, 2011 - Springer
Non-canonical inflection (suppletion, deponency, heteroclisis, etc.) is extensively studied in
theoretical approaches to morphology. However, these studies often lack practical …

[PDF][PDF] Large corpora for Turkic languages and unsupervised morphological analysis

V Baisa, V Suchomel - Proceedings of the Eighth conference on …, 2012 - nlp.fi.muni.cz
In this article we describe six new web corpora for Turkish, Azerbaijani, Kazakh, Turkmen,
Kyrgyz and Uzbek languages. The data for these corpora was automatically crawled from …