[BOOK][B] Introduction to Arabic natural language processing

NY Habash - 2010 - books.google.com
This book provides system developers and researchers in natural language processing and
computational linguistics with the necessary background information for working with the …

Toward gender-inclusive coreference resolution

YT Cao, H Daumé III - arxiv preprint arxiv:1910.13913, 2019 - arxiv.org
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systemic biases in coreference …

[PDF][PDF] MADA+ TOKAN: A toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming and lemmatization

N Habash, O Rambow, R Roth - Proceedings of the 2nd …, 2009 - researchgate.net
We describe the MADA+ TOKAN toolkit, a versatile and freely available system that can
derive extensive morphological and contextual information from raw Arabic text, and then …

[PDF][PDF] Efficient higher-order CRFs for morphological tagging

T Müller, H Schmid, H Schütze - Proceedings of the 2013 …, 2013 - aclanthology.org
Training higher-order conditional random fields is prohibitive for huge tag sets. We present
an approximated conditional random field using coarse-to-fine decoding and early updating …

Toward gender-inclusive coreference resolution: An analysis of gender and bias throughout the machine learning lifecycle

YT Cao, H Daumé III - Computational Linguistics, 2021 - aclanthology.org
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systematic biases in coreference …

Joint lemmatization and morphological tagging with lemming

T Muller, R Cotterell, A Fraser, H Schütze - arxiv preprint arxiv …, 2024 - arxiv.org
We present LEMMING, a modular log-linear model that jointly models lemmatization and
tagging and supports the integration of arbitrary global features. It is trainable on corpora …

[BOOK][B] Computational approaches to morphology and syntax

B Roark, R Sproat - 2007 - books.google.com
The book will appeal to scholars and advanced students of morphology, syntax,
computational linguistics and natural language processing (NLP). It provides a critical and …

Universal Lemmatizer: A sequence-to-sequence model for lemmatizing Universal Dependencies treebanks

J Kanerva, F Ginter, T Salakoski - Natural Language Engineering, 2021 - cambridge.org
In this paper, we present a novel lemmatization method based on a sequence-to-sequence
neural network architecture and morphosyntactic context representation. In the proposed …

[PDF][PDF] Arabic diacritization through full morphological tagging

N Habash, O Rambow - … 2007: The Conference of the North …, 2007 - aclanthology.org
Proceedings of NAACL HLT 2007 Page 1 Proceedings of NAACL HLT 2007, Companion
Volume, pages 53–56, Rochester, NY, April 2007. cO2007 Association for Computational …

[PDF][PDF] The best of two worlds: Cooperation of statistical and rule-based taggers for Czech

J Hajic, J Votrubec, P Krbec, P Kvĕtoň - Proceedings of the …, 2007 - aclanthology.org
Several hybrid disambiguation methods are described which combine the strength of hand-
written disambiguation rules and statistical taggers. Three different statistical (HMM …