- Academic Search

G Navarro - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …

Save Cite Cited by 119 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Journal of the ACM (JACM), 2020 - dl.acm.org

Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

Save Cite Cited by 206 Related articles All 12 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

At the roots of dictionary compression: string attractors

D Kempa, N Prezza - Proceedings of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org

A well-known fact in the field of lossless text compression is that high-order entropy is a
weak model when the input contains long repetitions. Motivated by this fact, decades of …

Save Cite Cited by 156 Related articles All 17 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Collapsing the hierarchy of compressed data structures: Suffix arrays in optimal compressed space

D Kempa, T Kociumaka - 2023 IEEE 64th Annual Symposium …, 2023 - ieeexplore.ieee.org

The last two decades have witnessed a dramatic increase in the amount of highly repetitive
datasets consisting of sequential data (strings, texts). Processing these massive amounts of …

Save Cite Cited by 21 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org Full View

Resolution of the burrows-wheeler transform conjecture

D Kempa, T Kociumaka - Communications of the ACM, 2022 - dl.acm.org

Abstract The Burrows-Wheeler Transform (BWT) is an invertible text transformation that
permutes symbols of a text according to the lexicographical order of its suffixes. BWT is the …

Save Cite Cited by 99 Related articles All 10 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] helsinki.fi

[BOOK][B] Genome-scale algorithm design

V Mäkinen, D Belazzougui, F Cunial, AI Tomescu - 2015 - books.google.com

High-throughput sequencing has revolutionised the field of biological sequence analysis. Its
application has enabled researchers to address important biological questions, often for the …

Save Cite Cited by 183 Related articles All 11 versions Free GPT-4 DeepSeek Library Search

[Free GPT-4]
[DeepSeek]

[PDF] siam.org

Optimal-time text indexing in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Proceedings of the Twenty-Ninth Annual ACM …, 2018 - SIAM

Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

Save Cite Cited by 135 Related articles All 13 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] unive.it

Towards a definitive measure of repetitiveness

T Kociumaka, G Navarro, N Prezza - Latin American Symposium on …, 2020 - Springer

Unlike in statistical compression, where Shannon's entropy is a definitive lower bound, no
such clear measure exists for the compressibility of repetitive sequences. Since statistical …

Save Cite Cited by 62 Related articles All 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

On compressing and indexing repetitive sequences

S Kreft, G Navarro - Theoretical Computer Science, 2013 - Elsevier

We introduce LZ-End, a new member of the Lempel–Ziv family of text compressors, which
achieves compression ratios close to those of LZ77 but is much faster at extracting arbitrary …

Save Cite Cited by 179 Related articles All 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Document spanners-a brief overview of concepts, results, and recent developments

ML Schmid, N Schweikardt - Proceedings of the 41st ACM SIGMOD …, 2022 - dl.acm.org

The information extraction framework of document spanners was introduced by Fagin,
Kimelfeld, Reiss, and Vansummeren (PODS 2013, J. ACM 2015) as a formalisation of the …

Save Cite Cited by 12 Related articles All 3 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Application of Lempel–Ziv factorization to the approximation of grammar-based compression

Indexing highly repetitive string collections, part II: Compressed indexes

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

At the roots of dictionary compression: string attractors

Collapsing the hierarchy of compressed data structures: Suffix arrays in optimal compressed space

Resolution of the burrows-wheeler transform conjecture

[BOOK][B] Genome-scale algorithm design

Optimal-time text indexing in BWT-runs bounded space

Towards a definitive measure of repetitiveness

On compressing and indexing repetitive sequences

Document spanners-a brief overview of concepts, results, and recent developments