What demographic attributes do our digital footprints reveal? A systematic review

J Hinds, AN Joinson - PloS one, 2018 - journals.plos.org
To what extent does our online activity reveal who we are? Recent research has
demonstrated that the digital traces left by individuals as they browse and interact with …

Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

[PDF][PDF] Overview of the 5th author profiling task at pan 2017: Gender and language variety identification in twitter

F Rangel, P Rosso, M Potthast… - Working notes papers of …, 2017 - downloads.webis.de
This overview presents the framework and the results of the Author Profiling task at PAN
2017. The objective of this year is to address gender and language variety identification. For …

An automated text categorization framework based on hyperparameter optimization

ES Tellez, D Moctezuma, S Miranda-Jiménez… - Knowledge-Based …, 2018 - Elsevier
A great variety of text tasks such as topic or spam identification, user profiling, and sentiment
analysis can be posed as a supervised learning problem and tackled using a text classifier …

[PDF][PDF] Text and image synergy with feature cross technique for gender identification

T Takahashi, T Tahara, K Nagatani… - … Notes Papers of …, 2018 - pdfs.semanticscholar.org
Text and Image Synergy with Feature Cross Technique for Gender Identification Page 1 Text
and Image Synergy with Feature Cross Technique for Gender Identification CLEF/PAN 2018 …

Novel semantic and statistic features-based author profiling approach

S Ouni, F Fkih, MN Omri - Journal of Ambient Intelligence and Humanized …, 2023 - Springer
Abstract The Author Profiling (AP) task aims to predict certain demographic (eg, age,
gender) about authors from their documents. AP on social media networks is gaining …

Fine-grained analysis of language varieties and demographics

F Rangel, P Rosso, W Zaghouani… - Natural Language …, 2020 - cambridge.org
The rise of social media empowers people to interact and communicate with anyone
anywhere in the world. The possibility of being anonymous avoids censorship and enables …

Automatic Arabic dialect identification systems for written texts: A survey

MJ Althobaiti - arxiv preprint arxiv:2009.12622, 2020 - arxiv.org
Arabic dialect identification is a specific task of natural language processing, aiming to
automatically predict the Arabic dialect of a given text. Arabic dialect identification is the first …

Simply the best: minimalist system trumps complex models in author profiling

A Basile, G Dwyer, M Medvedeva, J Rawee… - … Conference of the Cross …, 2018 - Springer
A simple linear SVM with word and character n-gram features and minimal parameter tuning
can identify the gender and the language variety (for English, Spanish, Arabic and …

[PDF][PDF] Gender identification through multi-modal tweet analysis using microtc and bag of visual words

ES Tellez, S Miranda-Jiménez, D Moctezuma… - Proceedings of the …, 2018 - ceur-ws.org
This manuscript describes our solution to solve the Author Profiling task at PAN'18. In this
edition, the task asks for identifying the user's gender using both their Tweets containing …