Topic modeling using latent Dirichlet allocation: A survey

U Chauhan, A Shah - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …

[PDF][PDF] Gaussian LDA for topic models with word embeddings

R Das, M Zaheer, C Dyer - … of the 53rd Annual Meeting of the …, 2015 - aclanthology.org
Continuous space word embeddings learned from large, unstructured corpora have been
shown to be effective at capturing semantic regularities in language. In this paper we …

Improving topic models with latent feature word representations

DQ Nguyen, R Billingsley, L Du… - Transactions of the …, 2015 - direct.mit.edu
Probabilistic topic models are widely used to discover latent topics in document collections,
while latent feature vector representations of words have been used to obtain high …

Retaining data from streams of social platforms with minimal regret

TT Nguyen, CT Duong, M Weidlich, H Yin… - … Joint Conference on …, 2017 - infoscience.epfl.ch
Today's social platforms, such as Twitter and Facebook, continuously generate massive
volumes of data. The resulting data streams exceed any reasonable limit for permanent …

RollingLDA: An update algorithm of Latent Dirichlet Allocation to construct consistent time series from textual data

J Rieger, C Jentsch, J Rahnenführer - Findings of the Association …, 2021 - aclanthology.org
We propose a rolling version of the Latent Dirichlet Allocation, called RollingLDA. By a
sequential approach, it enables the construction of LDA-based time series of topics that are …

Online learning from trapezoidal data streams

Q Zhang, P Zhang, G Long, W Ding… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In this paper, we study a new problem of continuous learning from doubly-streaming data
where both data volume and feature space increase over time. We refer to the doubly …

Feature incremental learning with causality

H Ni, S Gu, R Fan, C Hou - Pattern Recognition, 2024 - Elsevier
With the emerging of new data collection ways, the features are incremental and
accumulated gradually. Due to the expansion of feature spaces, it is more common that …

[КНИГА][B] Bayesian analysis in natural language processing

S Cohen - 2022 - books.google.com
Natural language processing (NLP) went through a profound transformation in the mid-
1980s when it shifted to make heavy use of corpora and data-driven techniques to analyze …

Combining LSTM and latent topic modeling for mortality prediction

Y Jo, L Lee, S Palaskar - arxiv preprint arxiv:1709.02842, 2017 - arxiv.org
There is a great need for technologies that can predict the mortality of patients in intensive
care units with both high accuracy and accountability. We present joint end-to-end neural …

Online passive-aggressive active learning for trapezoidal data streams

Y Liu, X Fan, W Li, Y Gao - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
The idea of combining the active query strategy and the passive-aggressive (PA) update
strategy in online learning can be credited to the PA active (PAA) algorithm, which has …