Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Indexing highly repetitive string collections, part II: compressed indexes
G Navarro - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …
represent them within their compressed space while at the same time offering indexed …
Fully functional suffix trees and optimal text searching in BWT-runs bounded space
Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …
versioned text collections—has become an important problem since the turn of the …
Optimal-time text indexing in BWT-runs bounded space
Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …
versioned text collections—has become an important problem since the turn of the …
Text indexing for long patterns: Anchors are all you need
In many real-world database systems, a large fraction of the data is represented by strings:
sequences of letters over some alphabet. This is because strings can easily encode data …
sequences of letters over some alphabet. This is because strings can easily encode data …
Wheeler maps
Motivated by challenges in pangenomic read alignment, we propose a generalization of
Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text T [1.. n] and an …
Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text T [1.. n] and an …
Gapped indexing for consecutive occurrences
The classic string indexing problem is to preprocess a string S into a compact data structure
that supports efficient pattern matching queries. Typical queries include existential queries …
that supports efficient pattern matching queries. Typical queries include existential queries …
[HTML][HTML] Indexing weighted sequences: Neat and efficient
A weighted sequence is a sequence of probability mass functions over a finite alphabet. A
weighted index is a data structure constructed for a weighted sequence and a threshold 1 z …
weighted index is a data structure constructed for a weighted sequence and a threshold 1 z …
Breaking the 𝒪(n)-Barrier in the Construction of Compressed Suffix Arrays and Suffix Trees
The suffix array, describing the lexicographical order of suffixes of a given text, and the suffix
tree, a path-compressed trie of all suffixes, are the two most fundamental data structures for …
tree, a path-compressed trie of all suffixes, are the two most fundamental data structures for …
Indexing highly repetitive string collections
G Navarro - arxiv preprint arxiv:2004.02781, 2020 - arxiv.org
Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …
represent them within their compressed space while at the same time offering indexed …
Gapped string indexing in subquadratic space and sublinear query time
In Gapped String Indexing, the goal is to compactly represent a string $ S $ of length $ n $
such that for any query consisting of two strings $ P_1 $ and $ P_2 $, called patterns, and an …
such that for any query consisting of two strings $ P_1 $ and $ P_2 $, called patterns, and an …