Automatic language identification in texts: A survey
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …
document or part thereof is written in. Automatic LI has been extensively researched for over …
[BOOK][B] Natural language processing for social media
A Farzindar, D Inkpen, G Hirst - 2015 - Springer
In recent years, online social networking has revolutionized interpersonal communication.
The newer research on language analysis in social media has been increasingly focusing …
The newer research on language analysis in social media has been increasingly focusing …
Extraction of emotions from multilingual text using intelligent text processing and computational linguistics
Abstract Extraction of Emotions from Multilingual Text posted on social media by different
categories of users is one of the crucial tasks in the field of opining mining and sentiment …
categories of users is one of the crucial tasks in the field of opining mining and sentiment …
The importance of the language for the evolution of online communities: An analysis based on Twitter and Reddit
Abstract The study of Online Social Networks offers growing opportunities to examine a
number of aspects of the real world and to better understand how human society works at …
number of aspects of the real world and to better understand how human society works at …
Twisty: a multilingual twitter stylometry corpus for gender and personality profiling
Personality profiling is the task of detecting personality traits of authors based on writing
style. Several personality typologies exist, however, the Briggs-Myer Type Indicator (MBTI) is …
style. Several personality typologies exist, however, the Briggs-Myer Type Indicator (MBTI) is …
Identifying languages at the word level in code-mixed indian social media text
Language identification at the document level has been considered an almost solved
problem in some application areas, but language detectors fail in the social media context …
problem in some application areas, but language detectors fail in the social media context …
Incorporating dialectal variability for socially equitable language identification
Abstract Language identification (LID) is a critical first step for processing multilingual text.
Yet most LID systems are not designed to handle the linguistic diversity of global platforms …
Yet most LID systems are not designed to handle the linguistic diversity of global platforms …
Citizen-centric urban planning through extracting emotion information from twitter in an interdisciplinary space-time-linguistics algorithm
Traditional urban planning processes typically happen in offices and behind desks. Modern
types of civic participation can enhance those processes by acquiring citizens' ideas and …
types of civic participation can enhance those processes by acquiring citizens' ideas and …
The growing amplification of social media: Measuring temporal and social contagion dynamics for over 150 languages on Twitter for 2009–2020
Working from a dataset of 118 billion messages running from the start of 2009 to the end of
2019, we identify and explore the relative daily use of over 150 languages on Twitter. We …
2019, we identify and explore the relative daily use of over 150 languages on Twitter. We …
Comparing the level of code-switching in corpora
Social media texts are often fairly informal and conversational, and when produced by
bilinguals tend to be written in several different languages simultaneously, in the same way …
bilinguals tend to be written in several different languages simultaneously, in the same way …