Text preprocessing for text mining in organizational research: Review and recommendations

L Hickman, S Thapa, L Tay, M Cao… - Organizational …, 2022 - journals.sagepub.com
Recent advances in text mining have provided new methods for capitalizing on the
voluminous natural language text data created by organizations, their employees, and their …

Stylometry with R: a package for computational text analysis

M Eder, J Rybicki, M Kestemont - The R Journal, 2016 - ruj.uj.edu.pl
This software paper describes 'Stylometry with R'(stylo), a flexible R package for the
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …

Authorship attribution

P Juola - Foundations and Trends® in Information Retrieval, 2008 - nowpublishers.com
Authorship attribution, the science of inferring characteristics of the author from the
characteristics of documents written by that author, is a problem with a long history and a …

Computational methods in authorship attribution

M Koppel, J Schler, S Argamon - Journal of the American …, 2009 - Wiley Online Library
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …

Big data analytics for security and criminal investigations

MI Pramanik, RYK Lau, WT Yue… - … reviews: data mining …, 2017 - Wiley Online Library
Applications of various data analytics technologies to security and criminal investigation
during the past three decades have demonstrated the inception, growth, and maturation of …

Authorship attribution in the wild

M Koppel, J Schler, S Argamon - Language Resources and Evaluation, 2011 - Springer
Most previous work on authorship attribution has focused on the case in which we need to
attribute an anonymous document to one of a small set of candidate authors. In this paper …

Overview of the cross-domain authorship verification task at PAN 2020

M Kestemont, E Manjavacas… - Working notes of …, 2020 - repository.uantwerpen.be
Authorship identification remains a highly topical research problem in computational text
analysis with many relevant applications in contemporary society and industry. For this …

Semstyle: Learning to generate stylised image captions using unaligned text

A Mathews, L **e, X He - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Linguistic style is an essential part of written communication, with the power to affect both
clarity and attractiveness. With recent advances in vision and language, we can start to …

[PDF][PDF] Function words in authorship attribution. From black magic to theory?

M Kestemont - Proceedings of the 3rd Workshop on …, 2014 - aclanthology.org
This position paper focuses on the use of function words in computational authorship
attribution. Although recently there have been multiple successful applications of authorship …

Frequency in lexical processing

RH Baayen, P Milin, M Ramscar - Aphasiology, 2016 - Taylor & Francis
Background: Frequency of occurrence is a strong predictor of lexical processing across
modalities and experimental paradigms. However, frequency is part of a large set of …