Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Indexing highly repetitive string collections, part II: compressed indexes
G Navarro - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Two decades ago, a breakthrough in indexing string collections made it possible to
represent them within their compressed space while at the same time offering indexed …
represent them within their compressed space while at the same time offering indexed …
[หนังสือ][B] The Logical Approach to Automatic Sequences
J Shallit - 2022 - books.google.com
Automatic sequences are sequences over a finite alphabet generated by a finite-state
machine. This book presents a novel viewpoint on automatic sequences, and more …
machine. This book presents a novel viewpoint on automatic sequences, and more …
Resolution of the burrows-wheeler transform conjecture
Abstract The Burrows-Wheeler Transform (BWT) is an invertible text transformation that
permutes symbols of a text according to the lexicographical order of its suffixes. BWT is the …
permutes symbols of a text according to the lexicographical order of its suffixes. BWT is the …
Collapsing the hierarchy of compressed data structures: Suffix arrays in optimal compressed space
The last two decades have witnessed a dramatic increase in the amount of highly repetitive
datasets consisting of sequential data (strings, texts). Processing these massive amounts of …
datasets consisting of sequential data (strings, texts). Processing these massive amounts of …
Optimal-time dictionary-compressed indexes
We describe the first self-indexes able to count and locate pattern occurrences in optimal
time within a space bounded by the size of the most popular dictionary compressors. To …
time within a space bounded by the size of the most popular dictionary compressors. To …
Internal pattern matching queries in a text and applications
We consider several types of internal queries, that is, questions about fragments of a given
text specified in constant space by their locations in. Our main result is an optimal data …
text specified in constant space by their locations in. Our main result is an optimal data …
Sensitivity of string compressors and repetitiveness measures
The sensitivity of a string compression algorithm C asks how much the output size C (T) for
an input string T can increase when a single character edit operation is performed on T. This …
an input string T can increase when a single character edit operation is performed on T. This …
Toward a definitive compressibility measure for repetitive sequences
While the th order empirical entropy is an accepted measure of the compressibility of
individual sequences on classical text collections, it is useful only for small values of and …
individual sequences on classical text collections, it is useful only for small values of and …
An upper bound and linear-space queries on the LZ-end parsing
Lempel–Ziv (LZ77) compression is the most commonly used lossless compression
algorithm. The basic idea is to greedily break the input string into blocks (called “phrases”) …
algorithm. The basic idea is to greedily break the input string into blocks (called “phrases”) …
Sigmoni: classification of nanopore signal with a compressed pangenome index
Improvements in nanopore sequencing necessitate efficient classification methods,
including pre-filtering and adaptive sampling algorithms that enrich for reads of interest …
including pre-filtering and adaptive sampling algorithms that enrich for reads of interest …