[BOOK][B] Introduction to Arabic natural language processing
NY Habash - 2010 - books.google.com
This book provides system developers and researchers in natural language processing and
computational linguistics with the necessary background information for working with the …
computational linguistics with the necessary background information for working with the …
Toward gender-inclusive coreference resolution
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systemic biases in coreference …
about those people. Such inferences raise the risk of systemic biases in coreference …
[PDF][PDF] MADA+ TOKAN: A toolkit for Arabic tokenization, diacritization, morphological disambiguation, POS tagging, stemming and lemmatization
We describe the MADA+ TOKAN toolkit, a versatile and freely available system that can
derive extensive morphological and contextual information from raw Arabic text, and then …
derive extensive morphological and contextual information from raw Arabic text, and then …
[PDF][PDF] Efficient higher-order CRFs for morphological tagging
Training higher-order conditional random fields is prohibitive for huge tag sets. We present
an approximated conditional random field using coarse-to-fine decoding and early updating …
an approximated conditional random field using coarse-to-fine decoding and early updating …
Toward gender-inclusive coreference resolution: An analysis of gender and bias throughout the machine learning lifecycle
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systematic biases in coreference …
about those people. Such inferences raise the risk of systematic biases in coreference …
Joint lemmatization and morphological tagging with lemming
We present LEMMING, a modular log-linear model that jointly models lemmatization and
tagging and supports the integration of arbitrary global features. It is trainable on corpora …
tagging and supports the integration of arbitrary global features. It is trainable on corpora …
[BOOK][B] Computational approaches to morphology and syntax
The book will appeal to scholars and advanced students of morphology, syntax,
computational linguistics and natural language processing (NLP). It provides a critical and …
computational linguistics and natural language processing (NLP). It provides a critical and …
Universal Lemmatizer: A sequence-to-sequence model for lemmatizing Universal Dependencies treebanks
In this paper, we present a novel lemmatization method based on a sequence-to-sequence
neural network architecture and morphosyntactic context representation. In the proposed …
neural network architecture and morphosyntactic context representation. In the proposed …
[PDF][PDF] Arabic diacritization through full morphological tagging
Proceedings of NAACL HLT 2007 Page 1 Proceedings of NAACL HLT 2007, Companion
Volume, pages 53–56, Rochester, NY, April 2007. cO2007 Association for Computational …
Volume, pages 53–56, Rochester, NY, April 2007. cO2007 Association for Computational …
[PDF][PDF] The best of two worlds: Cooperation of statistical and rule-based taggers for Czech
Several hybrid disambiguation methods are described which combine the strength of hand-
written disambiguation rules and statistical taggers. Three different statistical (HMM …
written disambiguation rules and statistical taggers. Three different statistical (HMM …