Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications

RK Halder, MN Uddin, MA Uddin, S Aryal, A Khraisat - Journal of Big Data, 2024 - Springer
Abstract The k-Nearest Neighbors (kNN) method, established in 1951, has since evolved
into a pivotal tool in data mining, recommendation systems, and Internet of Things (IoT) …

[HTML][HTML] Machine learning for Internet of Things data analysis: A survey

MS Mahdavinejad, M Rezvan, M Barekatain… - Digital Communications …, 2018 - Elsevier
Rapid developments in hardware, software, and communication technologies have
facilitated the emergence of Internet-connected sensory devices that provide observations …

Efficient kNN classification algorithm for big data

Z Deng, X Zhu, D Cheng, M Zong, S Zhang - Neurocomputing, 2016 - Elsevier
K nearest neighbors (kNN) is an efficient lazy learning algorithm and has successfully been
developed in real applications. It is natural to scale the kNN method to the large scale …

Fast approximate nearest neighbor search with the navigating spreading-out graph

C Fu, C **ang, C Wang, D Cai - arxiv preprint arxiv:1707.00143, 2017 - arxiv.org
Approximate nearest neighbor search (ANNS) is a fundamental problem in databases and
data mining. A scalable ANNS algorithm should be both memory-efficient and fast. Some …

Progressive distributed and parallel similarity retrieval of large CT image sequences in mobile telemedicine networks

Y Zhuang, N Jiang, Y Xu - Wireless communications and …, 2022 - Wiley Online Library
Computed tomography image (CTI) sequence is essentially a time‐series data that typically
consists of a large amount of nearby and similar CTIs. Due to the high communication and …

Survey on exact knn queries over high-dimensional data space

N Ukey, Z Yang, B Li, G Zhang, Y Hu, W Zhang - Sensors, 2023 - mdpi.com
k nearest neighbours (kNN) queries are fundamental in many applications, ranging from
data mining, recommendation system and Internet of Things, to Industry 4.0 framework …

Tsunami: A learned multi-dimensional index for correlated data and skewed workloads

J Ding, V Nathan, M Alizadeh, T Kraska - arxiv preprint arxiv:2006.13282, 2020 - arxiv.org
Filtering data based on predicates is one of the most fundamental operations for any modern
data warehouse. Techniques to accelerate the execution of filter expressions include …

Inter-media hashing for large-scale retrieval from heterogeneous data sources

J Song, Y Yang, Y Yang, Z Huang… - Proceedings of the 2013 …, 2013 - dl.acm.org
In this paper, we present a new multimedia retrieval paradigm to innovate large-scale
search of heterogenous multimedia data. It is able to return results of different media types …

BIRCH: an efficient data clustering method for very large databases

T Zhang, R Ramakrishnan, M Livny - ACM sigmod record, 1996 - dl.acm.org
Finding useful patterns in large datasets has attracted considerable interest recently, and
one of the most widely studied problems in this area is the identification of clusters, or …

3d-aware object goal navigation via simultaneous exploration and identification

J Zhang, L Dai, F Meng, Q Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Object goal navigation (ObjectNav) in unseen environments is a fundamental task for
Embodied AI. Agents in existing works learn ObjectNav policies based on 2D maps, scene …