Survey and taxonomy of lossless graph compression and space-efficient graph representations

M Besta, T Hoefler - arxiv preprint arxiv:1806.01799, 2018 - arxiv.org
Various graphs such as web or social networks may contain up to trillions of edges.
Compressing such datasets can accelerate graph processing by reducing the amount of I/O …

Computational graph pangenomics: a tutorial on data structures and their applications

JA Baaijens, P Bonizzoni, C Boucher… - Natural Computing, 2022 - Springer
Computational pangenomics is an emerging research field that is changing the way
computer scientists are facing challenges in biological sequence analysis. In past decades …

Indexing graphs for path queries with applications in genome research

J Sirén, N Välimäki, V Mäkinen - IEEE/ACM transactions on …, 2014 - ieeexplore.ieee.org
We propose a generic approach to replace the canonical sequence representation of
genomes with graph representations, and study several applications of such extensions. We …

Fully functional static and dynamic succinct trees

G Navarro, K Sadakane - ACM Transactions on Algorithms (TALG), 2014 - dl.acm.org
We propose new succinct representations of ordinal trees and match various space/time
lower bounds. It is known that any n-node static tree can be represented in 2 n+ o (n) bits so …

Succinct de Bruijn graphs

A Bowe, T Onodera, K Sadakane, T Shibuya - International workshop on …, 2012 - Springer
We propose a new succinct de Bruijn graph representation. If the de Bruijn graph of k-mers
in a DNA sequence of length N has m edges, it can be represented in 4 m+ o (m) bits. This is …

[HTML][HTML] Wheeler graphs: A framework for BWT-based data structures

T Gagie, G Manzini, J Sirén - Theoretical computer science, 2017 - Elsevier
Abstract The famous Burrows–Wheeler Transform (BWT) was originally defined for a single
string but variations have been developed for sets of strings, labeled trees, de Bruijn graphs …

On the representation of de Bruijn graphs

R Chikhi, A Limasset, S Jackman, JT Simpson… - … PA, USA, April 2-5, 2014 …, 2014 - Springer
The de Bruijn graph plays an important role in bioinformatics, especially in the context of de
novo assembly. However, the representation of the de Bruijn graph in memory is a …

Indexing variation graphs

J Sirén - 2017 Proceedings of the ninteenth workshop on …, 2017 - SIAM
Variation graphs, which represent genetic variation within a population, are replacing
sequences as reference genomes. Path indexes are one of the most important tools for …

Data Structures to Represent a Set of k-long DNA Sequences

R Chikhi, J Holub, P Medvedev - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The analysis of biological sequencing data has been one of the biggest applications of
string algorithms. The approaches used in many such applications are based on the …

Random access to grammar-compressed strings and trees

P Bille, GM Landau, R Raman, K Sadakane… - SIAM Journal on …, 2015 - SIAM
Grammar-based compression, where one replaces a long string by a small context-free
grammar that generates the string, is a simple and powerful paradigm that captures …