[PDF][PDF] Computational pan-genomics: status, promises and challenges

Briefings in bioinformatics, 2018 - academic.oup.com
Many disciplines, from human genetics and oncology to plant breeding, microbiology and
virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes …

Indexing highly repetitive string collections, part II: compressed indexes

G Navarro - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …

LoRDEC: accurate and efficient long read error correction

L Salmela, E Rivals - Bioinformatics, 2014 - academic.oup.com
Motivation: PacBio single molecule real-time sequencing is a third-generation sequencing
technique producing long reads, with comparatively lower throughput and higher error rate …

[CARTE][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Journal of the ACM (JACM), 2020 - dl.acm.org
Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

[CARTE][B] The algorithm design manual

SS Skiena - 2008 - Springer
This newly expanded and updated second edition of the best-selling classic continues to
take the" mystery" out of designing algorithms, and analyzing their efficacy and efficiency …

Indexing graphs for path queries with applications in genome research

J Sirén, N Välimäki, V Mäkinen - IEEE/ACM transactions on …, 2014 - ieeexplore.ieee.org
We propose a generic approach to replace the canonical sequence representation of
genomes with graph representations, and study several applications of such extensions. We …

Succinct indexable dictionaries with applications to encoding k-ary trees, prefix sums and multisets

R Raman, V Raman, SR Satti - ACM Transactions on Algorithms (TALG), 2007 - dl.acm.org
We consider the indexable dictionary problem, which consists of storing a set S⊆{0,…, m−
1} for some integer m while supporting the operations of rank (x), which returns the number …

[HTML][HTML] Wavelet trees for all

G Navarro - Journal of Discrete Algorithms, 2014 - Elsevier
The wavelet tree is a versatile data structure that serves a number of purposes, from string
processing to computational geometry. It can be regarded as a device that represents a …

A taxonomy of suffix array construction algorithms

SJ Puglisi, WF Smyth, AH Turpin - acm Computing Surveys (CSUR), 2007 - dl.acm.org
In 1990, Manber and Myers proposed suffix arrays as a space-saving alternative to suffix
trees and described the first algorithms for suffix array construction and use. Since that time …