BERTopic: Neural topic modeling with a class-based TF-IDF procedure

M Grootendorst - arxiv preprint arxiv:2203.05794, 2022 - arxiv.org
Topic models can be useful tools to discover latent topics in collections of documents.
Recent studies have shown the feasibility of approach topic modeling as a clustering task …

[BOOK][B] Data clustering: theory, algorithms, and applications

G Gan, C Ma, J Wu - 2020 - SIAM
The monograph Data Clustering: Theory, Algorithms, and Applications was published in
2007. Starting with the common ground and knowledge for data clustering, the monograph …

Modeling of spatial stratified heterogeneity

J Guo, J Wang, C Xu, Y Song - GIScience & Remote Sensing, 2022 - Taylor & Francis
Spatial stratified heterogeneity (SSH) refers to the geographical phenomena in which the
geographical attributes within-strata are more similar than the between-strata, which is …

Data stream clustering techniques, applications, and models: comparative analysis and discussion

U Kokate, A Deshpande, P Mahalle, P Patil - Big Data and Cognitive …, 2018 - mdpi.com
Data growth in today's world is exponential, many applications generate huge amount of
data streams at very high speed such as smart grids, sensor networks, video surveillance …

Enhanced fuzzy clustering for incomplete instance with evidence combination

Z Liu, S Letchmunan - ACM Transactions on Knowledge Discovery from …, 2024 - dl.acm.org
Clustering incomplete instance is still a challenging task since missing values maybe make
the cluster information ambiguous, leading to the uncertainty and imprecision in results. This …

{VBASE}: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity

Q Zhang, S Xu, Q Chen, G Sui, J **e, Z Cai… - … USENIX Symposium on …, 2023 - usenix.org
Approximate similarity queries on high-dimensional vector indices have become the
cornerstone for many critical online services. An increasing need for more sophisticated …

Do you hear the people sing? key point analysis via iterative clustering and abstractive summarisation

H Li, V Schlegel, R Batista-Navarro… - arxiv preprint arxiv …, 2023 - arxiv.org
Argument summarisation is a promising but currently under-explored field. Recent work has
aimed to provide textual summaries in the form of concise and salient short texts, ie, key …

[HTML][HTML] Zoning of reservoir water temperature field based on K-means clustering algorithm

W Liu, P Zou, D Jiang, X Quan, H Dai - Journal of Hydrology: Regional …, 2022 - Elsevier
Abstract Study region Dongqing Reservoir located in Guizhou, China. Study focus Zoning
the RWTF (reservoir water temperature field) is of great significance and is an effective way …

[HTML][HTML] Remaining discharge energy estimation for lithium-ion batteries using pattern recognition and power prediction

O Hatherall, MF Niri, A Barai, Y Li, J Marco - Journal of Energy Storage, 2023 - Elsevier
The remaining discharge energy (RDE) of a battery is an important value for estimating the
remaining range of a vehicle. Prediction based methods for calculating RDE have been …

Co-clustering: A Survey of the Main Methods, Recent Trends, and Open Problems

E Battaglia, F Peiretti, RG Pensa - ACM Computing Surveys, 2024 - dl.acm.org
Since its early formulations, co-clustering has gained popularity and interest both within and
outside the machine learning community as a powerful learning paradigm for clustering high …