Arabic text clustering using improved clustering algorithms with dimensionality reduction

AK Sangaiah, AE Fakhry, M Abdel-Basset… - Cluster Computing, 2019 - Springer
Arabic Text document clustering is an important aspect for providing conjectural navigation
and browsing techniques by organizing massive amounts of data into a small number of …

Hybrid intelligent technique for text categorization

AT Sadiq, SM Abdullah - 2012 international conference on …, 2012 - ieeexplore.ieee.org
Text categorization is the task in which documents are classified into one or more of
predefined categories based on their contents. This paper shows that the proposed system …

Document clustering using synthetic cluster prototypes

A Kalogeratos, A Likas - Data & Knowledge Engineering, 2011 - Elsevier
The use of centroids as prototypes for clustering text documents with the k-means family of
methods is not always the best choice for representing text clusters due to the high …

Application of Niblack's method on images

S Farid, F Ahmed - 2009 International Conference on Emerging …, 2009 - ieeexplore.ieee.org
Image segmentation is a major step in image analysis and processing. Segmentation is
performed through several methods. In this work Niblack's method of segmentation is further …

AutoDocSegmenter: A Geometric Approach towards Self-Supervised Document Segmentation

A Chatterjee, A Raj, S Dey, P Jawanpuria… - … on Machine Learning … - openreview.net
Document segmentation, the process of dividing a document into coherent and significant
regions, plays a crucial role for diverse applications that require parsing, retrieval, and …

A systematic approach to design of a text categorizer

RB Bradford, J Pozniak - 2016 IEEE International Conference …, 2016 - ieeexplore.ieee.org
In this paper, we implement a systematic approach to text categorization using latent
semantic indexing (LSI). A novel feature of our approach is that we iteratively refine the LSI …

Statistical computation and term weighting for feature extraction on Twitter

AI Kadhim - 2018 International Conference on Advance of …, 2018 - ieeexplore.ieee.org
The TF-IDF term weighting is one of the most successful methods in feature extraction.
Moreover, the highest term frequency (TF) for each document is concerned for feature …

[PDF][PDF] SIMILARITY THRESHOLD ESTIMATION FOR ENHANCED TEXT DOCUMENT CLUSTERING

MA Hassan, YA Al-Lahham - icicel.org
Traditional text document clustering techniques have a threshold value estimation
challenge. In most cases, it is a human decision, and the value should be determined …

[PDF][PDF] Area Extraction of beads in Membrane filter using Image Segmentation Techniques

N Taneja, S Goyal - Citeseer
This paper describes the methodology adopted to analyze the quality of Membrane filter.
The quality of Membrane filter is estimated by the uniformity of polymer beads which …

[CITAZIONE][C] A comprehensive analysis of using wordnet, part-of-speech tagging, and word sense disambiguation in text categorization

K Celik - Unpublished Master's Thesis, Department of Computer …, 2012