An integrated clustering and BERT framework for improved topic modeling

L George, P Sumathy - International Journal of Information Technology, 2023 - Springer
Topic modelling is a machine learning technique that is extensively used in Natural
Language Processing (NLP) applications to infer topics within unstructured textual data …

[PDF][PDF] Nonnegative matrix factorization for signal and data analytics: Identifiability, algorithms, and applications.

X Fu, K Huang, ND Sidiropoulos… - IEEE Signal Process …, 2019 - ieeexplore.ieee.org
X≈ WH, W∈ RM× R, H∈ RN× R,(1) to 'explain'the data matrix X, where W≥ 0, H≥ 0, and
R≤ min {M, N}. At first glance, NMF is nothing but an alternative factorization model to …

[KNJIGA][B] Nonnegative matrix factorization

N Gillis - 2020 - SIAM
Identifying the underlying structure of a data set and extracting meaningful information is a
key problem in data analysis. Simple and powerful methods to achieve this goal are linear …

Aggregated topic models for increasing social media topic coherence

SJ Blair, Y Bi, MD Mulvenna - Applied intelligence, 2020 - Springer
This research presents a novel aggregating method for constructing an aggregated topic
model that is composed of the topics with greater coherence than individual models. When …

[PDF][PDF] Unsupervised content-based identification of fake news articles with tensor decomposition ensembles

S Hosseinimotlagh, EE Papalexakis - Proceedings of the Workshop on …, 2018 - cs.ucr.edu
Social media provide a platform for quick and seamless access to information. However, the
propagation of false information, especially during the last year, raises major concerns …

On identifiability of nonnegative matrix factorization

X Fu, K Huang, ND Sidiropoulos - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
In this letter, we propose a new identification criterion that guarantees the recovery of the low-
rank latent factors in the nonnegative matrix factorization (NMF) generative model, under …

Blind audio source separation with minimum-volume beta-divergence NMF

V Leplat, N Gillis, AMS Ang - IEEE Transactions on Signal …, 2020 - ieeexplore.ieee.org
Considering a mixed signal composed of various audio sources and recorded with a single
microphone, we consider in this paper the blind audio source separation problem which …

Deep NMF topic modeling

J Wang, XL Zhang - Neurocomputing, 2023 - Elsevier
Nonnegative matrix factorization (NMF) based topic modeling methods do not rely on model-
or data-assumptions much. However, they are usually formulated as difficult optimization …

Semisoft clustering of single-cell data

L Zhu, J Lei, L Klei, B Devlin, K Roeder - Proceedings of the National …, 2019 - pnas.org
Motivated by the dynamics of development, in which cells of recognizable types, or pure cell
types, transition into other types over time, we propose a method of semisoft clustering that …

Population sequencing data reveal a compendium of mutational processes in the human germ line

VB Seplyarskiy, RA Soldatov, E Koch, RJ McGinty… - Science, 2021 - science.org
Biological mechanisms underlying human germline mutations remain largely unknown. We
statistically decompose variation in the rate and spectra of mutations along the genome …