Text preprocessing for text mining in organizational research: Review and recommendations
Recent advances in text mining have provided new methods for capitalizing on the
voluminous natural language text data created by organizations, their employees, and their …
voluminous natural language text data created by organizations, their employees, and their …
Stylometry with R: a package for computational text analysis
This software paper describes 'Stylometry with R'(stylo), a flexible R package for the
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …
Authorship attribution
P Juola - Foundations and Trends® in Information Retrieval, 2008 - nowpublishers.com
Authorship attribution, the science of inferring characteristics of the author from the
characteristics of documents written by that author, is a problem with a long history and a …
characteristics of documents written by that author, is a problem with a long history and a …
Computational methods in authorship attribution
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …
machine learning classification methods. Nevertheless, most of this work suffers from the …
Big data analytics for security and criminal investigations
Applications of various data analytics technologies to security and criminal investigation
during the past three decades have demonstrated the inception, growth, and maturation of …
during the past three decades have demonstrated the inception, growth, and maturation of …
Authorship attribution in the wild
Most previous work on authorship attribution has focused on the case in which we need to
attribute an anonymous document to one of a small set of candidate authors. In this paper …
attribute an anonymous document to one of a small set of candidate authors. In this paper …
Overview of the cross-domain authorship verification task at PAN 2020
M Kestemont, E Manjavacas… - Working notes of …, 2020 - repository.uantwerpen.be
Authorship identification remains a highly topical research problem in computational text
analysis with many relevant applications in contemporary society and industry. For this …
analysis with many relevant applications in contemporary society and industry. For this …
Semstyle: Learning to generate stylised image captions using unaligned text
Linguistic style is an essential part of written communication, with the power to affect both
clarity and attractiveness. With recent advances in vision and language, we can start to …
clarity and attractiveness. With recent advances in vision and language, we can start to …
[PDF][PDF] Function words in authorship attribution. From black magic to theory?
M Kestemont - Proceedings of the 3rd Workshop on …, 2014 - aclanthology.org
This position paper focuses on the use of function words in computational authorship
attribution. Although recently there have been multiple successful applications of authorship …
attribution. Although recently there have been multiple successful applications of authorship …
Frequency in lexical processing
Background: Frequency of occurrence is a strong predictor of lexical processing across
modalities and experimental paradigms. However, frequency is part of a large set of …
modalities and experimental paradigms. However, frequency is part of a large set of …