Challenges in benchmarking stream learning algorithms with real-world data

VMA Souza, DM dos Reis, AG Maletzke… - Data Mining and …, 2020 - Springer
Streaming data are increasingly present in real-world applications such as sensor
measurements, satellite data feed, stock market, and financial data. The main characteristics …

Federated learning on non-iid data silos: An experimental study

Q Li, Y Diao, Q Chen, B He - 2022 IEEE 38th international …, 2022 - ieeexplore.ieee.org
Due to the increasing privacy concerns and data regulations, training data have been
increasingly fragmented, forming distributed databases of multiple “data silos”(eg, within …

CoSaMP: Iterative signal recovery from incomplete and inaccurate samples

D Needell, JA Tropp - Applied and computational harmonic analysis, 2009 - Elsevier
Compressive sampling offers a new paradigm for acquiring signals that are compressible
with respect to an orthonormal basis. The major algorithmic challenge in compressive …

Synopses for massive data: Samples, histograms, wavelets, sketches

G Cormode, M Garofalakis, PJ Haas… - … and Trends® in …, 2011 - nowpublishers.com
Abstract Methods for Approximate Query Processing (AQP) are essential for dealing with
massive data. They are often the only means of providing interactive response times when …

Similarity estimation techniques from rounding algorithms

MS Charikar - Proceedings of the thiry-fourth annual ACM symposium …, 2002 - dl.acm.org
(MATH) A locality sensitive hashing scheme is a distribution on a family \F of hash functions
operating on a collection of objects, such that for two objects x, y, Pr h εF h (x)= h (y)= sim (x …

An improved data stream summary: the count-min sketch and its applications

G Cormode, S Muthukrishnan - Journal of Algorithms, 2005 - Elsevier
We introduce a new sublinear space data structure—the count-min sketch—for summarizing
data streams. Our sketch allows fundamental queries in data stream summarization such as …

Models and issues in data stream systems

B Babcock, S Babu, M Datar, R Motwani… - Proceedings of the twenty …, 2002 - dl.acm.org
In this overview paper we motivate the need for and research issues arising from a new
model of data processing. In this model, data does not take the form of persistent relations …

Finding frequent items in data streams

M Charikar, K Chen, M Farach-Colton - International Colloquium on …, 2002 - Springer
We present a 1-pass algorithm for estimating the most frequent items in a data stream using
very limited storage space. Our method relies on a novel data structure called a count …

Data streams: Algorithms and applications

S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …

Compressed sensing and best 𝑘-term approximation

A Cohen, W Dahmen, R DeVore - Journal of the American mathematical …, 2009 - ams.org
Compressed sensing is a new concept in signal processing where one seeks to minimize
the number of measurements to be taken from signals while still retaining the information …