Surveying stylometry techniques and applications

T Neal, K Sundararajan, A Fatima, Y Yan… - ACM Computing …, 2017 - dl.acm.org
The analysis of authorial style, termed stylometry, assumes that style is quantifiably
measurable for evaluation of distinctive qualities. Stylometry research has yielded several …

Authorship attribution for social media forensics

A Rocha, WJ Scheirer, CW Forstall… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
The veil of anonymity provided by smartphones with pre-paid SIM cards, public Wi-Fi
hotspots, and distributed networks like Tor has drastically complicated the task of identifying …

[HTML][HTML] Efficient English text classification using selected machine learning techniques

X Luo - Alexandria Engineering Journal, 2021 - Elsevier
Text classification (TC) is an approach used for the classification of any kind of documents
for the target category or out. In this paper, we implemented the Support Vector Machines …

[HTML][HTML] Bayesian data analysis for newcomers

JK Kruschke, TM Liddell - Psychonomic bulletin & review, 2018 - Springer
This article explains the foundational concepts of Bayesian data analysis using virtually no
mathematical notation. Bayesian ideas already match your intuitions from everyday …

A survey of modern authorship attribution methods

E Stamatatos - Journal of the American Society for information …, 2009 - Wiley Online Library
Authorship attribution supported by statistical or computational methods has a long history
starting from the 19th century and is marked by the seminal study of Mosteller and Wallace …

Computational methods in authorship attribution

M Koppel, J Schler, S Argamon - Journal of the American …, 2009 - Wiley Online Library
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …

[PDF][PDF] Spam filtering with naive bayes-which naive bayes?

V Metsis, I Androutsopoulos, G Paliouras - CEAS, 2006 - nlp.cs.aueb.gr
Naive Bayes is very popular in commercial and open-source anti-spam e-mail filters. There
are, however, several forms of Naive Bayes, something the anti-spam literature does not …

Wikipedia-based semantic interpretation for natural language processing

E Gabrilovich, S Markovitch - Journal of Artificial Intelligence Research, 2009 - jair.org
Adequate representation of natural language semantics requires access to vast amounts of
common sense and domain-specific world knowledge. Prior work in the field was based on …

Automatic dimensionality selection from the scree plot via the use of profile likelihood

M Zhu, A Ghodsi - Computational Statistics & Data Analysis, 2006 - Elsevier
Most dimension reduction techniques produce ordered coordinates so that only the first few
coordinates need be considered in subsequent analyses. The choice of how many …

Authorship verification as a one-class classification problem

M Koppel, J Schler - Proceedings of the twenty-first international …, 2004 - dl.acm.org
In the authorship verification problem, we are given examples of the writing of a single
author and are asked to determine if given long texts were or were not written by this author …