Recent Approaches and Trends in Approximate Nearest Neighbor Search, with Remarks on Benchmarking.

M Aumüller, M Ceccarello - IEEE Data Eng. Bull., 2023 - sites.computer.org
Nearest neighbor search is a computational primitive whose efficiency is paramount to many
applications. As such, the literature recently blossomed with many works focusing on …

[HTML][HTML] The role of local dimensionality measures in benchmarking nearest neighbor search

M Aumüller, M Ceccarello - Information Systems, 2021 - Elsevier
This paper reconsiders common benchmarking approaches to nearest neighbor search. It is
shown that the concepts of local intrinsic dimensionality (LID), local relative contrast (RC) …

ParlayANN: Scalable and Deterministic Parallel Graph-Based Approximate Nearest Neighbor Search Algorithms

MD Manohar, Z Shen, G Blelloch, L Dhulipala… - Proceedings of the 29th …, 2024 - dl.acm.org
Approximate nearest-neighbor search (ANNS) algorithms are a key part of the modern deep
learning stack due to enabling efficient similarity search over high-dimensional vector space …

Fast and Scalable Mining of Time Series Motifs with Probabilistic Guarantees

M Ceccarello, J Gamper - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
Mining time series motifs is a fundamental, yet expensive task in exploratory data analytics.
In this paper, we therefore propose a fast method to find the top-k motifs with probabilistic …

Multi-class network traffic generators and classifiers based on neural networks

RF Bikmukhamedov, AF Nadeev - … Processing in the Field of on …, 2021 - ieeexplore.ieee.org
We introduce a neural network framework that allows constructing multi-class network traffic
models suitable for flow generation and classification tasks. Packet size and inter-packet …

Deann: Speeding up kernel-density estimation using approximate nearest neighbor search

M Karppa, M Aumüller, R Pagh - … Conference on Artificial …, 2022 - proceedings.mlr.press
Abstract Kernel Density Estimation (KDE) is a nonparametric method for estimatig the shape
of a density function, given a set of samples from the distribution. Recently, locality-sensitive …

A Survey on Efficient Processing of Similarity Queries over Neural Embeddings

Y Wang - arxiv preprint arxiv:2204.07922, 2022 - arxiv.org
Similarity query is the family of queries based on some similarity metrics. Unlike the
traditional database queries which are mostly based on value equality, similarity queries aim …

A new fast inverted file-based algorithm for approximate nearest neighbor search without accuracy reduction

Y Liu, Z Pan, L Wang, Y Wang - Information Sciences, 2022 - Elsevier
The inverted file (IVF) is one of the most efficient non-exhaustive search algorithms for
approximate nearest neighbor (ANN) search. In this paper, we propose a new three-step fast …

Solving k-Closest Pairs in High-Dimensional Data

M Aumüller, M Ceccarello - International Conference on Similarity Search …, 2023 - Springer
We investigate the k-closest pair problem in high dimensions, that is finding the k≥ 1 closest
pairs of points in a set S⊆ X in a metric space (X, dist). This is a fundamental problem in …

Enhancing Approximate Nearest Neighbor Search: Binary-Indexed LSH-Tries, Trie Rebuilding, and Batch Extraction

CJW Romild, TH Schauser, JA Borup - International Conference on …, 2023 - Springer
Abstract Locality-Sensitive-Hashing (LSH) plays a crucial role in approximate nearest
neighbour search and similarity-based queries. In this paper, we present a study on the …