Big data stream analysis: a systematic literature review

T Kolajo, O Daramola, A Adebiyi - Journal of Big Data, 2019 - Springer
Recently, big data streams have become ubiquitous due to the fact that a number of
applications generate a huge amount of data at a great velocity. This made it difficult for …

Tight bounds for adversarially robust streams and sliding windows via difference estimators

DP Woodruff, S Zhou - 2021 IEEE 62nd Annual Symposium on …, 2022 - ieeexplore.ieee.org
In the adversarially robust streaming model, a stream of elements is presented to an
algorithm and is allowed to depend on the output of the algorithm at earlier times during the …

Near-Optimal -Clustering in the Sliding Window Model

D Woodruff, P Zhong, S Zhou - Advances in Neural …, 2024 - proceedings.neurips.cc
Clustering is an important technique for identifying structural information in large-scale data
analysis, where the underlying dataset may be too large to store. In many applications …

Near optimal linear algebra in the online and sliding window models

V Braverman, P Drineas, C Musco… - 2020 IEEE 61st …, 2020 - ieeexplore.ieee.org
We initiate the study of numerical linear algebra in the sliding window model, where only the
most recent W updates in a stream form the underlying data set. Although many existing …

How to make your approximation algorithm private: A black-box differentially-private transformation for tunable approximation algorithms of functions with low …

J Blocki, E Grigorescu, T Mukherjee, S Zhou - arxiv preprint arxiv …, 2022 - arxiv.org
We develop a framework for efficiently transforming certain approximation algorithms into
differentially-private variants, in a black-box manner. Specifically, our results focus on …

The predicted-deletion dynamic model: Taking advantage of ml predictions, for free

QC Liu, V Srinivas - arxiv preprint arxiv:2307.08890, 2023 - arxiv.org
The main bottleneck in designing efficient dynamic algorithms is the unknown nature of the
update sequence. In particular, there are some problems, like 3-vertex connectivity, planar …

Truly perfect samplers for data streams and sliding windows

R Jayaram, DP Woodruff, S Zhou - … of the 41st ACM SIGMOD-SIGACT …, 2022 - dl.acm.org
In the G-sampling problem, the goal is to output an index i of a vector f∈ Rn, such that for all
coordinates j∈[n],[Pr [i= j]=(1±ε)(G (fj))/(∑ k∈[n] G (fk))+ γ,] where G: R→ R≥ 0 is some non …

The predicted-updates dynamic model: Offline, incremental, and decremental to fully dynamic transformations

QC Liu, V Srinivas - The Thirty Seventh Annual Conference …, 2024 - proceedings.mlr.press
The main bottleneck in designing efficient dynamic algorithms is the unknown nature of the
update sequence. In particular, there are problems where the separation in runtime between …

Matrix sketching over sliding windows

Z Wei, X Liu, F Li, S Shang, X Du, JR Wen - Proceedings of the 2016 …, 2016 - dl.acm.org
Large-scale matrix computation becomes essential for many data data applications, and
hence the problem of sketching matrix with small space and high precision has received …

Stream frequency over interval queries

RB Basat, R Friedman, R Shahout - Proceedings of the VLDB …, 2018 - dl.acm.org
Stream frequency measurements are fundamental in many data stream applications such as
financial data trackers, intrusion-detection systems, and network monitoring. Typically …