Anchored correlation explanation: Topic modeling with minimal domain knowledge
While generative models such as Latent Dirichlet Allocation (LDA) have proven fruitful in
topic modeling, they often require detailed assumptions and careful specification of …
topic modeling, they often require detailed assumptions and careful specification of …
Navigating the local modes of big data
Each day humans generate massive volumes of data in a variety of different forms (Lazer et
al., 2009). For example, digitized texts provide a rich source of political content through …
al., 2009). For example, digitized texts provide a rich source of political content through …
Multilingual anchoring: Interactive topic modeling and alignment across languages
Multilingual topic models can reveal patterns in cross-lingual document collections.
However, existing models lack speed and interactivity, which prevents adoption in everyday …
However, existing models lack speed and interactivity, which prevents adoption in everyday …
Tandem anchoring: A multiword anchor approach for interactive topic modeling
Interactive topic models are powerful tools for those seeking to understand large collections
of text. However, existing sampling-based interactive topic modeling approaches scale …
of text. However, existing sampling-based interactive topic modeling approaches scale …
Learning topic models--provably and efficiently
Today, we have both the blessing and the curse of being overloaded with information. Never
before has text been more important to how we communicate, or more easily available. But …
before has text been more important to how we communicate, or more easily available. But …
[PDF][PDF] Is your anchor going up or down? Fast and accurate supervised topic models
Topic models provide insights into document collections, and their supervised extensions
also capture associated document-level metadata such as sentiment. However, inferring …
also capture associated document-level metadata such as sentiment. However, inferring …
Toward interpretable topic discovery via anchored correlation explanation
Many predictive tasks, such as diagnosing a patient based on their medical chart, are
ultimately defined by the decisions of human experts. Unfortunately, encoding experts' …
ultimately defined by the decisions of human experts. Unfortunately, encoding experts' …
Learning latent topics from the word co-occurrence network
Topic modeling is widely used to uncover the latent thematic structure in corpora. Based on
the separability assumption, the spectral method focuses on the word co-occurrence …
the separability assumption, the spectral method focuses on the word co-occurrence …
Robust spectral inference for joint stochastic matrix factorization
Spectral inference provides fast algorithms and provable optimality for latent topic analysis.
But for real data these algorithms require additional ad-hoc heuristics, and even then often …
But for real data these algorithms require additional ad-hoc heuristics, and even then often …
Unsupervised Hierarchical Topic Modeling via Anchor Word Clustering and Path Guidance
Hierarchical topic models nowadays tend to capture the relationship between words and
topics, often ignoring the role of anchor words that guide text generation. For the first time …
topics, often ignoring the role of anchor words that guide text generation. For the first time …