Near-Optimal -Clustering in the Sliding Window Model

D Woodruff, P Zhong, S Zhou - Advances in Neural …, 2024 - proceedings.neurips.cc
Clustering is an important technique for identifying structural information in large-scale data
analysis, where the underlying dataset may be too large to store. In many applications …

Online prediction in sub-linear space

B Peng, F Zhang - Proceedings of the 2023 Annual ACM-SIAM …, 2023 - SIAM
We provide the first sub-linear space and sub-linear regret algorithm for online learning with
expert advice (against an oblivious adversary), addressing an open question raised recently …

Streaming algorithms for learning with experts: Deterministic versus robust

DP Woodruff, F Zhang, S Zhou - arxiv preprint arxiv:2303.01709, 2023 - arxiv.org
In the online learning with experts problem, an algorithm must make a prediction about an
outcome on each of $ T $ days (or times), given a set of $ n $ experts who make predictions …

A Strong Separation for Adversarially Robust ℓ0 Estimation for Linear Sketches

E Gribelyuk, H Lin, DP Woodruff, H Yu… - 2024 IEEE 65th …, 2024 - ieeexplore.ieee.org
The majority of streaming problems are defined and analyzed in a static setting, where the
data stream is any worst-case sequence of insertions and deletions which is fixed in …

On robust streaming for learning with experts: algorithms and lower bounds

D Woodruff, F Zhang, S Zhou - Advances in Neural …, 2023 - proceedings.neurips.cc
In the online learning with experts problem, an algorithm makes predictions about an
outcome on each of $ T $ days, given a set of $ n $ experts who make predictions on each …

Improved algorithms for white-box adversarial streams

Y Feng, D Woodruff - International Conference on Machine …, 2023 - proceedings.mlr.press
We study streaming algorithms in the white-box adversarial stream model, where the
internal state of the streaming algorithm is revealed to an adversary who adaptively …

Fully dynamic shortest path reporting against an adaptive adversary

A Alokhina, J van den Brand - Proceedings of the 2024 Annual ACM-SIAM …, 2024 - SIAM
Algebraic data structures are the main subroutine for maintaining distances in fully dynamic
graphs in subquadratic time. However, these dynamic algebraic algorithms generally cannot …

Differentially Private -Heavy Hitters in the Sliding Window Model

J Blocki, S Lee, T Mukherjee, S Zhou - arxiv preprint arxiv:2302.11081, 2023 - arxiv.org
The data management of large companies often prioritize more recent data, as a source of
higher accuracy prediction than outdated data. For example, the Facebook data policy …

Streaming algorithms for the missing item finding problem

M Stoeckl - Proceedings of the 2023 Annual ACM-SIAM …, 2023 - SIAM
Many problems on data streams have been studied at two extremes of difficulty: either
allowing randomized algorithms, in the static setting (where they should err with bounded …

[PDF][PDF] A New Information Complexity Measure for Multi-pass Streaming with Applications

M Braverman, S Garg, Q Li, S Wang… - Proceedings of the 56th …, 2024 - dl.acm.org
We introduce a new notion of information complexity for multi-pass streaming problems and
use it to resolve several important questions in data streams. In the coin problem, one sees a …