Short text topic modeling techniques, applications, and performance: a survey
Analyzing short texts infers discriminative and coherent latent topics that is a critical and
fundamental task since many real-world applications require semantic understanding of …
fundamental task since many real-world applications require semantic understanding of …
Evaluation of clustering and topic modeling methods over health-related tweets and emails
Background Internet provides different tools for communicating with patients, such as social
media (eg, Twitter) and email platforms. These platforms provided new data sources to shed …
media (eg, Twitter) and email platforms. These platforms provided new data sources to shed …
Concept Drift Adaptation in Text Stream Mining Settings: A Comprehensive Review
Due to the advent and increase in the popularity of the Internet, people have been producing
and disseminating textual data in several ways, such as reviews, social media posts, and …
and disseminating textual data in several ways, such as reviews, social media posts, and …
Zero-shot micro-video classification with neural variational inference in graph prototype network
Micro-video classification plays a central role in online content recommendation platforms,
such as Kwai and Tik-Tok. Existing works on video classification largely exploit the …
such as Kwai and Tik-Tok. Existing works on video classification largely exploit the …
A Dirichlet process biterm-based mixture model for short text stream clustering
Short text stream clustering has become an important problem for mining textual data in
diverse social media platforms (eg, Twitter). However, most of the existing clustering …
diverse social media platforms (eg, Twitter). However, most of the existing clustering …
Specious sites: Tracking the spread and sway of spurious news stories at scale
Misinformation, propaganda, and outright lies proliferate on the web, with some narratives
having dangerous real-world consequences on public health, elections, and individual …
having dangerous real-world consequences on public health, elections, and individual …
Topic modeling of short texts: A pseudo-document view with word embedding enhancement
Recent years have witnessed the unprecedented growth of online social media, resulting in
short texts being the prevalent format of information on the Internet. Given the sparsity of …
short texts being the prevalent format of information on the Internet. Given the sparsity of …
A nonparametric model for online topic discovery with word embeddings
With the explosive growth of short documents generated from streaming textual sources (eg,
Twitter), latent topic discovery has become a critical task for short text stream clustering …
Twitter), latent topic discovery has become a critical task for short text stream clustering …
Logparse: Making log parsing adaptive through word classification
Logs are one of the most valuable data sources for large-scale service (eg, social network,
search engine) maintenance. Log parsing serves as the the first step towards automated log …
search engine) maintenance. Log parsing serves as the the first step towards automated log …
Benchmarking crisis in social media analytics: a solution for the data-sharing problem
Computational social science uses computational and statistical methods in order to
evaluate social interaction. The public availability of data sets is thus a necessary …
evaluate social interaction. The public availability of data sets is thus a necessary …