[BOEK][B] Data clustering: theory, algorithms, and applications

G Gan, C Ma, J Wu - 2020 - SIAM
The monograph Data Clustering: Theory, Algorithms, and Applications was published in
2007. Starting with the common ground and knowledge for data clustering, the monograph …

Synopses for massive data: Samples, histograms, wavelets, sketches

G Cormode, M Garofalakis, PJ Haas… - … and Trends® in …, 2011 - nowpublishers.com
Abstract Methods for Approximate Query Processing (AQP) are essential for dealing with
massive data. They are often the only means of providing interactive response times when …

An improved data stream summary: the count-min sketch and its applications

G Cormode, S Muthukrishnan - Journal of Algorithms, 2005 - Elsevier
We introduce a new sublinear space data structure—the count-min sketch—for summarizing
data streams. Our sketch allows fundamental queries in data stream summarization such as …

Data streams: Algorithms and applications

S Muthukrishnan - Foundations and Trends® in Theoretical …, 2005 - nowpublishers.com
In the data stream scenario, input arrives very rapidly and there is limited memory to store
the input. Algorithms have to work with one or few passes over the data, space less than …

Methods for mining frequent items in data streams: an overview

H Liu, Y Lin, J Han - Knowledge and information systems, 2011 - Springer
In many real-world applications, information such as web click data, stock ticker data, sensor
network data, phone call records, and traffic monitoring data appear in the form of data …

Compressed sensing and best 𝑘-term approximation

A Cohen, W Dahmen, R DeVore - Journal of the American mathematical …, 2009 - ams.org
Compressed sensing is a new concept in signal processing where one seeks to minimize
the number of measurements to be taken from signals while still retaining the information …

Clustering data streams: Theory and practice

S Guha, A Meyerson, N Mishra… - IEEE transactions on …, 2003 - ieeexplore.ieee.org
The data stream model has recently attracted attention for its applicability to numerous types
of data, including telephone records, Web documents, and clickstreams. For analysis of such …

The history of histograms (abridged)

Y Ioannidis - Proceedings 2003 VLDB Conference, 2003 - Elsevier
Publisher Summary The history of histograms is long and rich, full of detailed information in
every step. It includes the course of histograms in different scientific fields, the successes …

What's hot and what's not: tracking most frequent items dynamically

G Cormode, S Muthukrishnan - ACM Transactions on Database Systems …, 2005 - dl.acm.org
Most database management systems maintain statistics on the underlying relation. One of
the important statistics is that of the “hot items” in the relation: those that appear many times …

Finding frequent items in data streams

G Cormode, M Hadjieleftheriou - Proceedings of the VLDB Endowment, 2008 - dl.acm.org
The frequent items problem is to process a stream of items and find all items occurring more
than a given fraction of the time. It is one of the most heavily studied problems in data stream …