Locally uniform hashing

IO Bercea, L Beretta, J Klausen… - 2023 IEEE 64th …, 2023 - ieeexplore.ieee.org
Hashing is a common technique used in data processing, with a strong impact on the time
and resources spent on computation. Hashing also affects the applicability of theoretical …

Fairhash: A fair and memory/time-efficient hashmap

N Shahbazi, S Sintos, A Asudeh - … of the ACM on Management of Data, 2024 - dl.acm.org
Hashmap is a fundamental data structure in computer science. There has been extensive
research on constructing hashmaps that minimize the number of collisions leading to …

Understanding the moments of tabulation hashing via chaoses

JBT Houen, M Thorup - arxiv preprint arxiv:2205.01453, 2022 - arxiv.org
Simple tabulation hashing dates back to Zobrist in 1970 and is defined as follows: Each key
is viewed as $ c $ characters from some alphabet $\Sigma $, we have $ c $ fully random …

Hashing for Sampling-Based Estimation

A Aamand, IO Bercea, JBT Houen, J Klausen… - arxiv preprint arxiv …, 2024 - arxiv.org
Hash-based sampling and estimation are common themes in computing. Using hashing for
sampling gives us the coordination needed to compare samples from different sets. Hashing …

A Fair and Memory/Time-efficient Hashmap

A Asudeh, N Shahbazi, S Sintos - arxiv preprint arxiv:2307.11355, 2023 - arxiv.org
Hashmap is a fundamental data structure in computer science. There has been extensive
research on constructing hashmaps that minimize the number of collisions leading to …

No repetition: Fast and reliable sampling with highly concentrated hashing

A Aamand, D Das, E Kipouridis, JBT Knudsen… - Proceedings of the …, 2022 - dl.acm.org
Stochastic sample-based estimators are among the most fundamental and universally
applied tools in statistics. Such estimators are particularly important when processing huge …

A sparse Johnson-Lindenstrauss transform using fast hashing

JBT Houen, M Thorup - arxiv preprint arxiv:2305.03110, 2023 - arxiv.org
The\emph {Sparse Johnson-Lindenstrauss Transform} of Kane and Nelson (SODA 2012)
provides a linear dimensionality-reducing map $ A\in\mathbb {R}^{m\times u} $ in $\ell_2 …

No Repetition: Fast Streaming with Highly Concentrated Hashing

A Aamand, D Das, E Kipouridis, JBT Knudsen… - arxiv preprint arxiv …, 2020 - arxiv.org
To get estimators that work within a certain error bound with high probability, a common
strategy is to design one that works with constant probability, and then boost the probability …