Sublinear time Lempel-Ziv (LZ77) factorization

J Ellert - International Symposium on String Processing and …, 2023 - Springer
Abstract The Lempel-Ziv (LZ77) factorization of a string is a widely-used algorithmic tool that
plays a central role in data compression and indexing. For a length-n string over integer …

[HTML][HTML] DandD: efficient measurement of sequence growth and similarity

JK Bonnie, OY Ahmed, B Langmead - Iscience, 2024 - cell.com
Genome assembly databases are growing rapidly. The redundancy of sequence content
between a new assembly and previous ones is neither conceptually nor algorithmically easy …

Pfp-fm: an accelerated FM-index

A Hong, M Oliva, D Köppl, H Bannai, C Boucher… - Algorithms for Molecular …, 2024 - Springer
FM-indexes are crucial data structures in DNA alignment, but searching with them usually
takes at least one random access per character in the query pattern. Ferragina and Fischer …

Lempel-Ziv (LZ77) factorization in sublinear time

D Kempa, T Kociumaka - 2024 IEEE 65th Annual Symposium …, 2024 - ieeexplore.ieee.org
Lempel-Ziv (LZ77) factorization is a fundamental problem in string processing: Greedily
partition a given string T from left to right into blocks (called phrases) so that each phrase is …

[PDF][PDF] Efficient string algorithmics across alphabet realms

J Ellert - 2024 - eldorado.tu-dortmund.de
Stringology is a subfield of computer science dedicated to analyzing and processing
sequences of symbols. It plays a crucial role in various applications, including lossless …

Top-k Frequent Patterns in Streams and Parameterized-Space LZ Compression

P Dinklage, J Fischer, N Prezza - 22nd International Symposium …, 2024 - drops.dagstuhl.de
We present novel online approximations of the Lempel-Ziv 77 (LZ77) and Lempel-Ziv 78
(LZ78) compression schemes [Lempel & Ziv, 1977/1978] with parameterizable space usage …

Measuring Genomic Data with PFP

Z Liptak, F Masillo, S Lucá - bioRxiv, 2025 - biorxiv.org
Prefix free parsing [Boucher et al., Alg. Mol. Biol., 2019], is a highly effective heuristic for
computing text indexes for very large amounts of biological data. The algorithm constructs a …

[PDF][PDF] Enhancing Data Compression: Recent Innovations in LZ77 Algorithms

A Hong, C Boucher - 2024 - preprints.org
The burgeoning volume of genomic data, fueled by advances in sequencing technologies,
demands efficient data compression solutions. Traditional algorithms like Lempel-Ziv77 …

[หนังสือ][B] Building Succinct Data Structures for Pangenomics

M Oliva - 2023 - search.proquest.com
With DNA sequencing becoming a routine analysis, the amount of genomic data collected is
growing rapidly, having already reached the pace, only destined to grow, of more than 35 …

Perceptions and Experiences of Undergraduate Medical Students Regarding Social Accountability: a Cross-sectional Study at a Subsaharan African Medical School

L Oriokot, IG Munabi, S Kiguli, AG Mubuuke - europepmc.org
Background Medical schools are called to be socially accountable as a feature of excellent
medical education. Medical students are essential to the development of socially …