Compressed full-text indexes

G Navarro, V Mäkinen - ACM Computing Surveys (CSUR), 2007 - dl.acm.org
Full-text indexes provide fast substring search over large text collections. A serious problem
of these indexes has traditionally been their space consumption. A recent trend is to develop …

A taxonomy of suffix array construction algorithms

SJ Puglisi, WF Smyth, AH Turpin - acm Computing Surveys (CSUR), 2007 - dl.acm.org
In 1990, Manber and Myers proposed suffix arrays as a space-saving alternative to suffix
trees and described the first algorithms for suffix array construction and use. Since that time …

Linear work suffix array construction

J Kärkkäinen, P Sanders, S Burkhardt - Journal of the ACM (JACM), 2006 - dl.acm.org
Suffix trees and suffix arrays are widely used and largely interchangeable index structures
on strings and sequences. Practitioners prefer suffix arrays due to their simplicity and space …

Hamsa: Fast signature generation for zero-day polymorphic worms with provable attack resilience

Z Li, M Sanghi, Y Chen, MY Kao… - 2006 IEEE Symposium …, 2006 - ieeexplore.ieee.org
Zero-day polymorphic worms pose a serious threat to the security of Internet infrastructures.
Given their rapid propagation, it is crucial to detect them at edge networks and automatically …

Two efficient algorithms for linear time suffix array construction

G Nong, S Zhang, WH Chan - IEEE transactions on computers, 2010 - ieeexplore.ieee.org
We present, in this paper, two efficient algorithms for linear time suffix array construction.
These two algorithms achieve their linear time complexities, using the techniques of divide …

[書籍][B] Handbook of computational molecular biology

S Aluru - 2005 - taylorfrancis.com
The enormous complexity of biological systems at the molecular level must be answered
with powerful computational methods. Computational biology is a young field, but has seen …

Linear suffix array construction by almost pure induced-sorting

G Nong, S Zhang, WH Chan - 2009 data compression …, 2009 - ieeexplore.ieee.org
We present a linear time and space suffix array (SA) construction algorithm called the SA-IS
algorithm. The SA-IS algorithm is novel because of the LMS-substrings used for the problem …

Compressed text indexes: From theory to practice

P Ferragina, R González, G Navarro… - Journal of Experimental …, 2009 - dl.acm.org
A compressed full-text self-index represents a text in a compressed form and still answers
queries efficiently. This represents a significant advancement over the (full-) text indexing …

A survey on zero-day polymorphic worm detection techniques

R Kaur, M Singh - IEEE Communications Surveys & Tutorials, 2014 - ieeexplore.ieee.org
Zero-day polymorphic worms pose a serious threat to the Internet security. With their ability
to rapidly propagate, these worms increasingly threaten the Internet hosts and services. Not …

[PDF][PDF] Linear-Time Computation of Similarity Measures for Sequential Data.

K Rieck, P Laskov - Journal of Machine Learning Research, 2008 - jmlr.org
Efficient and expressive comparison of sequences is an essential procedure for learning
with sequential data. In this article we propose a generic framework for computation of …