Topic modeling: Perspectives from a literature review

S Robledo, M Zuluaga - IEEE Access, 2022 - ieeexplore.ieee.org
Topic modeling is a Natural Language Processing technique that has gained popularity over
the last ten years, with applications in multiple fields of knowledge. However, there is …

Finding the number of latent topics with semantic non-negative matrix factorization

R Vangara, M Bhattarai, E Skau, G Chennupati… - IEEE …, 2021 - ieeexplore.ieee.org
Topic modeling, or identifying the set of topics that occur in a collection of articles, is one of
the primary objectives of text mining. One of the big challenges in topic modeling is …

Semi-supervised classification of malware families under extreme class imbalance via hierarchical non-negative matrix factorization with automatic model selection

ME Eren, M Bhattarai, RJ Joyce, E Raff… - ACM Transactions on …, 2023 - dl.acm.org
Identification of the family to which a malware specimen belongs is essential in
understanding the behavior of the malware and develo** mitigation strategies. Solutions …

Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization

RC Barron, V Grantcharov, S Wanna, ME Eren… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) are pre-trained on large-scale corpora and excel in
numerous general natural language processing (NLP) tasks, such as question answering …

General-purpose unsupervised cyber anomaly detection via non-negative tensor factorization

ME Eren, JS Moore, E Skau, E Moore… - … Threats: Research and …, 2023 - dl.acm.org
Distinguishing malicious anomalous activities from unusual but benign activities is a
fundamental challenge for cyber defenders. Prior studies have shown that statistical user …

Cyber-Security Knowledge Graph Generation by Hierarchical Nonnegative Matrix Factorization

R Barron, ME Eren, M Bhattarai… - … on Digital Forensics …, 2024 - ieeexplore.ieee.org
Much of human knowledge in cybersecurity is encapsulated within the ever-growing volume
of scientific papers. As this textual data continues to expand, the importance of document …

Senmfk-split: Large corpora topic modeling by semantic non-negative matrix factorization with automatic model selection

ME Eren, N Solovyev, M Bhattarai… - Proceedings of the …, 2022 - dl.acm.org
As the amount of text data continues to grow, topic modeling is serving an important role in
understanding the content hidden by the overwhelming quantity of documents. One popular …

Semantic Non-Negative Matrix Factorization for Term Extraction

A Nugumanova, A Alzhanov, A Mansurova… - Big Data and Cognitive …, 2024 - mdpi.com
This study introduces an unsupervised term extraction approach that combines non-
negative matrix factorization (NMF) with word embeddings. Inspired by a pioneering …

Classifying Malware Using Tensor Decomposition

ME Eren, BS Alexandrov, C Nicholas - Malware: Handbook of Prevention …, 2024 - Springer
Tensor decomposition is a powerful unsupervised machine learning technique capable of
modeling multidimensional data, including that related to malware. This chapter discusses a …

Distributed non-negative rescal with automatic model selection for exascale data

M Bhattarai, I Boureima, E Skau, B Nebgen… - Journal of Parallel and …, 2023 - Elsevier
With the boom in the development of computer hardware and software, social media, IoT
platforms, and communications, there has been exponential growth in the volume of data …