Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey

H Jelodar, Y Wang, C Yuan, X Feng, X Jiang… - Multimedia tools and …, 2019 - Springer
Topic modeling is one of the most powerful techniques in text mining for data mining, latent
data discovery, and finding relationships among data and text documents. Researchers …

[책][B] Lifelong machine learning

Z Chen, B Liu - 2018 - books.google.com
Lifelong Machine Learning, Second Edition is an introduction to an advanced machine
learning paradigm that continuously learns by accumulating past knowledge that it then …

Applications of topic models

J Boyd-Graber, Y Hu, D Mimno - Foundations and Trends® in …, 2017 - nowpublishers.com
How can a single person understand what's going on in a collection of millions of
documents? This is an increasingly common problem: sifting through an organization's e …

Distributionally robust language modeling

Y Oren, S Sagawa, TB Hashimoto, P Liang - arxiv preprint arxiv …, 2019 - arxiv.org
Language models are generally trained on data spanning a wide range of topics (eg, news,
reviews, fiction), but they might be applied to an a priori unknown target distribution (eg …

Care and feeding of topic models: Problems, diagnostics, and improvements

J Boyd-Graber, D Mimno… - Handbook of mixed …, 2014 - api.taylorfrancis.com
Topic models are statistical models for learning the latent structure in document collections,
and have gained much attention in the machine learning community over the last decade …

LDA-based topic modeling sentiment analysis using topic/document/sentence (TDS) model

A Farkhod, A Abdusalomov, F Makhmudov, YI Cho - Applied Sciences, 2021 - mdpi.com
Customer reviews on the Internet reflect users' sentiments about the product, service, and
social events. As sentiments can be divided into positive, negative, and neutral forms …

Revisiting multi-domain machine translation

MQ Pham, JM Crego, F Yvon - Transactions of the Association for …, 2021 - direct.mit.edu
When building machine translation systems, one often needs to make the best out of
heterogeneous sets of parallel data in training, and to robustly handle inputs from …

Chunk-based nearest neighbor machine translation

PH Martins, Z Marinho, AFT Martins - arxiv preprint arxiv:2205.12230, 2022 - arxiv.org
Semi-parametric models, which augment generation with retrieval, have led to impressive
results in language modeling and machine translation, due to their ability to retrieve fine …

Cost weighting for neural machine translation domain adaptation

B Chen, C Cherry, G Foster… - Proceedings of the First …, 2017 - aclanthology.org
In this paper, we propose a new domain adaptation technique for neural machine translation
called cost weighting, which is appropriate for adaptation scenarios in which a small in …

Neural machine translation with sentence-level topic context

K Chen, R Wang, M Utiyama… - … /ACM Transactions on …, 2019 - ieeexplore.ieee.org
Traditional neural machine translation (NMT) methods use the word-level context to predict
target language translation while neglecting the sentence-level context, which has been …