Source coding, large deviations, and approximate pattern matching

A Dembo, L Kontoyiannis - IEEE Transactions on Information …, 2002 - ieeexplore.ieee.org
We present a development of parts of rate-distortion theory and pattern-matching algorithms
for lossy data compression, centered around a lossy version of the asymptotic equipartition …

[PDF][PDF] Fifty years of Shannon theory

S Verdu - IEEE Transactions on information theory, 1998 - academia.edu
Fifty Years of Shannon Theory Page 1 IEEE TRANSACTIONS ON INFORMATION THEORY,
VOL. 44, NO. 6, OCTOBER 1998 2057 Fifty Years of Shannon Theory Sergio Verdú, Fellow …

[KSIĄŻKA][B] Average case analysis of algorithms on sequences

W Szpankowski - 2011 - books.google.com
A timely book on a topic that has witnessed a surge of interest over the last decade, owing in
part to several novel applications, most notably in data compression and computational …

[KSIĄŻKA][B] The ergodic theory of discrete sample paths

PC Shields - 1996 - books.google.com
This book is about finite-alphabet stationary processes, which are important in physics,
engineering, and data compression. The focus is on the combinatorial properties of typical …

Nonparametric entropy estimation for stationary processes and random fields, with applications to English text

I Kontoyiannis, PH Algoet, YM Suhov… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org
We discuss a family of estimators for the entropy rate of a stationary ergodic process and
prove their pointwise and mean consistency under a Doeblin-type mixing condition. The …

The context-tree weighting method: Extensions

FMJ Willems - IEEE Transactions on Information Theory, 1998 - ieeexplore.ieee.org
First we modify the basic (binary) context-tree weighting method such that the past symbols
x/sub 1-D/, x/sub 2-D/,..., x/sub 0/are not needed by the encoder and the decoder. Then we …

Estimating the entropy of binary time series: Methodology, some theory and a simulation study

Y Gao, I Kontoyiannis, E Bienenstock - Entropy, 2008 - mdpi.com
Partly motivated by entropy-estimation problems in neuroscience, we present a detailed and
extensive comparison between some of the most popular and effective entropy estimation …

Compression of low entropy strings with Lempel--Ziv algorithms

SR Kosaraju, G Manzini - SIAM Journal on Computing, 2000 - SIAM
We compare the compression ratio of the Lempel--Ziv algorithms with the empirical entropy
of the input string. This approach makes it possible to analyze the performance of these …

Asymptotic behavior of the Lempel-Ziv parsing scheme and digital search trees

P Jacquet, W Szpankowski - Theoretical Computer Science, 1995 - Elsevier
The Lempel-Ziv parsing scheme finds a wide range of applications, most notably in data
compression and algorithms on words. It partitions a sequence of length n into variable …

[PDF][PDF] A database index to large biological sequences

E Hunt, MP Atkinson, RW Irving - work, 2001 - academia.edu
We present an approach to searching genetic DNA sequences using an adaptation of the
sufx tree data structure deployed on the general purpose persistent Java platform, PJama …