A mixed method lemmatization algorithm using a hierarchy of linguistic identities (HOLI)

AK Ingason, S Helgadóttir, H Loftsson… - Advances in natural …, 2008 - Springer
We present a new mixed method lemmatizer for Icelandic, Lemmald, which achieves good
performance by relying on IceTagger [1] for tagging and The Icelandic Frequency Dictionary …

Nefnir: A high accuracy lemmatizer for Icelandic

SL Ingólfsdóttir, H Loftsson, JF Daðason… - ar** a PoS-tagged corpus using existing tools
H Loftsson, JH Yngvason, S Helgadóttir… - … SaLTMiL Workshop on …, 2010 - academia.edu
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting
of about 1 million tokens. The goal is to use the corpus, among other things, as a new gold …

Comparing rule-based and SMT-based spelling normalisation for English historical texts

G Schneider, E Pettersson, M Percillier - 2017 - zora.uzh.ch
To be able to use existing natural language processing tools for analysing historical text, an
important preprocessing step is spelling normalisation, converting the original spelling to …

[PDF][PDF] Almannaromur: An open icelandic speech corpus

J Guðnason, O Kjartansson, J Jóhannsson… - … for Under-Resourced …, 2012 - isca-archive.org
The purpose of the Almannarómur project is collecting data for a speech corpus (database)
for Icelandic. Its main aim is creating an open source speech project to enable research and …

[PDF][PDF] Context-sensitive spelling correction and rich morphology

AK Ingason, SB Jóhannsson… - Proceedings of the …, 2009 - aclanthology.org
Context-sensitive spelling correction is the task of correcting spelling errors which result in
valid words. We present work in progress where we adapt established methods from English …

[PDF][PDF] Waste not, want not: Towards a system architecture for ICALL based on NLP component re-use

E Volodina, L Borin, H Loftsson… - Proceedings of the …, 2012 - academia.edu
It is a surprising fact that, despite the existence of various mature Natural Language
Processing (NLP) tools and resources that can potentially benefit language learning, very …