Arabic information retrieval: Stemming or lemmatization?

I Zeroual, A Lakhouaja - 2017 Intelligent Systems and …, 2017 - ieeexplore.ieee.org
The Arabic language is expanding in the world. According to UNESCO, the Arabic language
is spoken by more than 422 million native speakers around 29 countries and among 1.6 …

Improving Arabic lemmatization through a lemmas database and a machine-learning technique

D Namly, K Bouzoubaa, A El Jihad… - Recent Advances in NLP …, 2020 - Springer
Lemmatization is a key preprocessing step and an important component for many natural
language applications. For Arabic language, lemmatization is a complex task due to Arabic …

Introduction to the special issue on African Language Technology

G De Pauw, GM De Schryver, L Pretorius… - Language Resources and …, 2011 - Springer
In today's digital multilingual world, language technology is crucial for providing access to
information and opportunities for economic development. With approximately two thousand …

The power of language music: Arabic lemmatization through patterns

M Attia, A Zirikly, M Diab - Proceedings of the 5th Workshop on …, 2016 - aclanthology.org
The interaction between roots and patterns in Arabic has intrigued lexicographers and
morphologists for centuries. While roots provide the consonantal building blocks, patterns …

[PDF][PDF] Setswana Verb Analyzer and Generator

G Malema, N Motlogelwa, B Okgetheng… - International Journal of …, 2016 - academia.edu
Morphological analysis is one of the first steps in natural language studies. It is a basic
component in a number of natural language processing systems. There are a few attempts …

Computational syntactic analysis of Setswana

AS Berg - 2018 - repository.nwu.ac.za
The main aim of this study is the computational syntactic analysis of the Setswana simple
sentence, using Lexical Functional Grammar (LFG) as framework and XLE as the associated …

Translation technology in south africa

GB van Huyssteen, M Puttkammer… - Routledge …, 2023 - taylorfrancis.com
This chapter focuses on the history and state of the art of MT research and development in
South Africa for South African languages. It first provides an overview of the lead-up to MT …

[PDF][PDF] Setswana Noun Analyzer and Generator

G Malema, M Motlhanka, B Okgetheng… - International Journal of …, 2018 - academia.edu
Word morphology is a process of analysing word formation. Morphological analysis is one of
the pre-processing steps in natural language processing tasks. Few studies have looked at …

and Marissa Griesel

GB van Huyssteen, M Puttkammer… - Routledge …, 2023 - books.google.com
South Africa has a rich and diverse multilingual culture with eleven official languages, two
Germanic languages (English and Afrikaans), four Nguni languages (isiNdebele [Ndebele] …

Morphological segmentation of isiXhosa using unsupervised machine learning

L Mzamo - 2021 - repository.nwu.ac.za
In this work the use of unsupervised machine learning in the morphological segmentation of
Nguni languages, evaluated on isiXhosa, is advanced. The work researches, extends …