Synopses for massive data: Samples, histograms, wavelets, sketches

G Cormode, M Garofalakis, PJ Haas… - … and Trends® in …, 2011 - nowpublishers.com
Abstract Methods for Approximate Query Processing (AQP) are essential for dealing with
massive data. They are often the only means of providing interactive response times when …

Maximum error-bounded piecewise linear representation for online stream approximation

Q **e, C Pang, X Zhou, X Zhang, K Deng - The VLDB journal, 2014 - Springer
Given a time series data stream, the generation of error-bounded Piecewise Linear
Representation (error-bounded PLR) is to construct a number of consecutive line segments …

Constructing comprehensive summaries of large event sequences

J Kiernan, E Terzi - ACM Transactions on Knowledge Discovery from …, 2009 - dl.acm.org
Event sequences capture system and user activity over time. Prior research on sequence
mining has mostly focused on discovering local patterns appearing in a sequence. While …

Tight results for clustering and summarizing data streams

S Guha - Proceedings of the 12th International Conference on …, 2009 - dl.acm.org
In this paper we investigate algorithms and lower bounds for summarization problems over a
single pass data stream. In particular we focus on histogram construction and K-center …

An algorithmic approach to event summarization

P Wang, H Wang, M Liu, W Wang - Proceedings of the 2010 ACM …, 2010 - dl.acm.org
Recently, much study has been directed toward summarizing event data, in the hope that the
summary will lead us to a better understanding of the system that generates the events …

Efficient optimal Kolmogorov approximation of random variables

L Cohen, T Grinshpoun, G Weiss - Artificial Intelligence, 2024 - Elsevier
Discrete random variables are essential ingredients in various artificial intelligence
problems. These include the estimation of the probability of missing the deadline in a series …

Approximating points by a piecewise linear function

DZ Chen, H Wang - Algorithmica, 2013 - Springer
Approximating points by piecewise linear functions is an intensively researched topic in
computational geometry. In this paper, we study, based on the uniform error metric, an array …

Monitoring incremental histogram distribution for change detection in data streams

R Sebastião, J Gama, PP Rodrigues… - … Discovery from Sensor …, 2010 - Springer
Histograms are a common technique for density estimation and they have been widely used
as a tool in exploratory data analysis. Learning histograms from static and stationary data is …

Hierarchical synopses with optimal error guarantees

P Karras, N Mamoulis - ACM Transactions on Database Systems (TODS …, 2008 - dl.acm.org
Hierarchical synopsis structures offer a viable alternative in terms of efficiency and flexibility
in relation to traditional summarization techniques such as histograms. Previous research on …

Computing unrestricted synopses under maximum error bound

C Pang, Q Zhang, X Zhou, D Hansen, S Wang… - Algorithmica, 2013 - Springer
Constructing Haar wavelet synopses with guaranteed maximum error on data
approximations has many real world applications. In this paper, we take a novel approach …