Text stemming: Approaches, applications, and challenges

J Singh, V Gupta - ACM Computing Surveys (CSUR), 2016 - dl.acm.org
Stemming is a process in which the variant word forms are mapped to their base form. It is
among the basic text pre-processing approaches used in Language Modeling, Natural …

A Systematic Review of Stemmers of Indian and Non-Indian Vernacular Languages

NR Dave, MA Mehta, K Kotecha - ACM Transactions on Asian and Low …, 2024 - dl.acm.org
The stemming process is crucial and significant in the pre-processing step of natural
language processing. The stemmer oversees the stemming process. It facilitates the …

Building a multilevel inflection handling stemmer to improve search effectiveness for Urdu Language

A Jabbar, S Iqba, A Alaulamie, M Ilahi - IEEE Access, 2024 - ieeexplore.ieee.org
Stemming is an essential step in various Natural Language Processing (NLP) applications
and is used to reduce different variants of the query words to a standard form to avoid the …

The rule-based sundanese stemmer

AA Suryani, DH Widyantoro, A Purwarianti… - ACM Transactions on …, 2018 - dl.acm.org
Our research proposed an iterative Sundanese stemmer by removing the derivational affixes
prior to the inflexional. This scheme was chosen because, in the Sundanese affixation, a …

KreolStem: A hybrid language-dependent stemmer for Kreol Morisien

B Gobin-Rahimbux, I Maudhoo… - Journal of Experimental …, 2024 - Taylor & Francis
Stemming is a technique used to transform words to their root forms. It is used in various
Natural Language Processing applications to improve performance and accuracy. In this …

[PDF][PDF] Comparative study of truncating and statistical stemming algorithms

S Memon, GA Mallah, KN Memon… - International Journal of …, 2020 - academia.edu
Search and indexing systems bear a significant quality called word stemming, is lump of
content excavating requests, IR frameworks and natural language handling frameworks. The …

An Extended Pattern Based Comprehensive Stemmer for the Urdu Language

M Ali, A Baqir, HH Raza Sherazi, S Khalid… - ACM Transactions on …, 2024 - dl.acm.org
The Urdu language is used by approximately 200 million people for spoken and written
communications on a daily basis. There is a substantial amount of unstructured Urdu textual …

Effect of Stopwords and Stemming Techniques in Urdu IR

SS Sahu, D Dutta, S Pal, I Rasheed - SN Computer Science, 2023 - Springer
This paper explores and evaluates the effect of different stopword removal and stemming
techniques in Urdu IR. The issues are examined from four viewpoints. Is there any …

LALITHA: A light weight Malayalam stemmer using suffix strip** method

U Prajitha, C Sreejith, PCR Raj - … International Conference on …, 2013 - ieeexplore.ieee.org
Stemming is the process of removing the affixes from inflections and to return the root form.
Malayalam is highly agglutinative in nature and hundreds of inflections are possible for each …

[PDF][PDF] Sindhi stemmer using affix removal method

AA Sattar, S Abbasi, MU Rahman, A Baig… - International …, 2021 - academia.edu
Stemming is the process of map** various inflections of a word to its base form. Stemmer
is an essential component of Information Retrieval (IR) systems and different Natural …