Beyond memorization: Violating privacy via inference with large language models

R Staab, M Vero, M Balunović, M Vechev - arxiv preprint arxiv …, 2023 - arxiv.org
Current privacy research on large language models (LLMs) primarily focuses on the issue of
extracting memorized training data. At the same time, models' inference capabilities have …

What demographic attributes do our digital footprints reveal? A systematic review

J Hinds, AN Joinson - PloS one, 2018 - journals.plos.org
To what extent does our online activity reveal who we are? Recent research has
demonstrated that the digital traces left by individuals as they browse and interact with …

[PDF][PDF] A report on the first native language identification shared task

J Tetreault, D Blanchard, A Cahill - … on innovative use of NLP for …, 2013 - aclanthology.org
Abstract Native Language Identification, or NLI, is the task of automatically classifying the L1
of a writer based solely on his or her essay written in another language. This problem area …

Large scale personality classification of bloggers

F Iacobelli, AJ Gill, S Nowson, J Oberlander - Affective Computing and …, 2011 - Springer
Personality is a fundamental component of an individual's affective behavior. Previous work
on personality classification has emerged from disparate sources: Varieties of algorithms …

Twitter user profiling based on text and community mining for market analysis

K Ikeda, G Hattori, C Ono, H Asoh… - Knowledge-Based Systems, 2013 - Elsevier
This paper proposes demographic estimation algorithms for profiling Twitter users, based on
their tweets and community relationships. Many people post their opinions via social media …

A report on the 2017 native language identification shared task

S Malmasi, K Evanini, A Cahill, J Tetreault… - Proceedings of the …, 2017 - aclanthology.org
Abstract Native Language Identification (NLI) is the task of automatically identifying the
native language (L1) of an individual based on their language production in a learned …

[PDF][PDF] Stylometric analysis of scientific articles

S Bergsma, M Post, D Yarowsky - … of the 2012 Conference of the …, 2012 - aclanthology.org
We present an approach to automatically recover hidden attributes of scientific articles, such
as whether the author is a native English speaker, whether the author is a male or a female …

Native language identification in texts: A survey

D Goswami, S Thilagan, K North… - Proceedings of the …, 2024 - aclanthology.org
We present the first comprehensive survey of Native Language Identification (NLI) applied to
texts. NLI is the task of automatically identifying an author's native language (L1) based on …

[PDF][PDF] Improving native language identification with tf-idf weighting

BG Gebre, M Zampieri, P Wittenburg… - Proceedings of the …, 2013 - aclanthology.org
This paper presents a Native Language Identification (NLI) system based on TF-IDF
weighting schemes and using linear classifiers-support vector machines, logistic …

[PDF][PDF] INAOE's Participation at PAN'15: Author Profiling task.

MÁ Álvarez-Carmona, AP López-Monroy… - CLEF (Working …, 2015 - downloads.webis.de
In this paper, we describe the participation of the Language Technologies Lab of INAOE at
PAN 2015. According to the Author Profiling (AP) literature. In this paper we take such …