Cache-oblivious high-performance similarity join
A similarity join combines vectors based on a distance condition. Typically, such algorithms
apply a filter step (by indexing or sorting) and then refine pairs of candidate vectors. In this …
apply a filter step (by indexing or sorting) and then refine pairs of candidate vectors. In this …
Approximate similarity search under edit distance using locality-sensitive hashing
S McCauley - arxiv preprint arxiv:1907.01600, 2019 - arxiv.org
Edit distance similarity search, also called approximate pattern matching, is a fundamental
problem with widespread database applications. The goal of the problem is to preprocess …
problem with widespread database applications. The goal of the problem is to preprocess …
Similarity search with tensor core units
Abstract Tensor Core Units (TCUs) are hardware accelerators developed for deep neural
networks, which efficiently support the multiplication of two dense m * m matrices, where m is …
networks, which efficiently support the multiplication of two dense m * m matrices, where m is …
Implementing distributed similarity joins using locality sensitive hashing
Similarity joins are a basic primitive in data mining. Given two sets of points, we are
interested in reporting all pairs of points whose similarity is above a user-defined threshold …
interested in reporting all pairs of points whose similarity is above a user-defined threshold …
Accelerating array joining with integrated value-index
Large-scale multidimensional array processing is becoming an increasingly important
problem with the rise of big data, scientific data processing, and machine learning …
problem with the rise of big data, scientific data processing, and machine learning …
LayerLSH: Rebuilding Locality-Sensitive Hashing Indices by Exploring Density of Hash Values
Locality-sensitive hashing (LSH) has attracted extensive research efforts for approximate
nearest neighbors (NN) search. However, most of these LSH-based index structures fail to …
nearest neighbors (NN) search. However, most of these LSH-based index structures fail to …
[書籍][B] Advanced data structures: theory and applications
Advanced data structures is a core course in Computer Science which most graduate
program in Computer Science, Computer Science and Engineering, and other allied …
program in Computer Science, Computer Science and Engineering, and other allied …
Mathematical Models for Local Sensing Hashes
L Wang, L Wangner - arxiv preprint arxiv:2111.08344, 2021 - arxiv.org
As data volumes continue to grow, searches in data are becoming increasingly time-
consuming. Classical index structures for neighbor search are no longer sustainable due to …
consuming. Classical index structures for neighbor search are no longer sustainable due to …
[PDF][PDF] Implementing Distributed Approximate Similarity Joins using Locality Sensitive Hashing
Similarity joins are a basic primitive in data mining. Given two sets of points, we are
interested in reporting all pairs of points whose similarity is above a user-defined threshold …
interested in reporting all pairs of points whose similarity is above a user-defined threshold …