A mixed method lemmatization algorithm using a hierarchy of linguistic identities (HOLI)
We present a new mixed method lemmatizer for Icelandic, Lemmald, which achieves good
performance by relying on IceTagger [1] for tagging and The Icelandic Frequency Dictionary …
performance by relying on IceTagger [1] for tagging and The Icelandic Frequency Dictionary …
Nefnir: A high accuracy lemmatizer for Icelandic
SL Ingólfsdóttir, H Loftsson, JF Daðason… - ar** a PoS-tagged corpus using existing tools
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting
of about 1 million tokens. The goal is to use the corpus, among other things, as a new gold …
of about 1 million tokens. The goal is to use the corpus, among other things, as a new gold …
Comparing rule-based and SMT-based spelling normalisation for English historical texts
To be able to use existing natural language processing tools for analysing historical text, an
important preprocessing step is spelling normalisation, converting the original spelling to …
important preprocessing step is spelling normalisation, converting the original spelling to …
[PDF][PDF] Almannaromur: An open icelandic speech corpus
The purpose of the Almannarómur project is collecting data for a speech corpus (database)
for Icelandic. Its main aim is creating an open source speech project to enable research and …
for Icelandic. Its main aim is creating an open source speech project to enable research and …
[PDF][PDF] Context-sensitive spelling correction and rich morphology
AK Ingason, SB Jóhannsson… - Proceedings of the …, 2009 - aclanthology.org
Context-sensitive spelling correction is the task of correcting spelling errors which result in
valid words. We present work in progress where we adapt established methods from English …
valid words. We present work in progress where we adapt established methods from English …
[PDF][PDF] Waste not, want not: Towards a system architecture for ICALL based on NLP component re-use
It is a surprising fact that, despite the existence of various mature Natural Language
Processing (NLP) tools and resources that can potentially benefit language learning, very …
Processing (NLP) tools and resources that can potentially benefit language learning, very …