[BOOK][B] Handbook of computational molecular biology

S Aluru - 2005 - taylorfrancis.com
The enormous complexity of biological systems at the molecular level must be answered
with powerful computational methods. Computational biology is a young field, but has seen …

On the performance of bitmap indices for high cardinality attributes

K Wu, E Otoo, A Shoshani - VLDB, 2004 - books.google.com
It is well established that bitmap indices are efficient for read-only attributes with low attribute
cardinalities. For an attribute with a high cardinality, the size of the bitmap index can be very …

Bio-strings: a relational database data-type for dealing with large biosequences

S Lifschitz, EH Haeusler, M Catanho, AB de Miranda… - BioTech, 2022 - mdpi.com
DNA sequencers output a large set of very long biological data strings that we should persist
in databases rather than basic text file systems. Many different data models and database …

Practical methods for constructing suffix trees

Y Tian, S Tata, RA Hankins, JM Patel - The VLDB Journal, 2005 - Springer
Sequence datasets are ubiquitous in modern life-science applications, and querying
sequences is a common and critical operation in many of these applications. The suffix tree …

Oasis: An online and accurate technique for local-alignment searches on biological sequences

C Meek, JM Patel, S Kasetty - Proceedings 2003 VLDB Conference, 2003 - Elsevier
Publisher Summary This chapter introduces a new algorithm called online and accurate
search technique for Inferring local-alignments on sequences (OASIS), which improves …

Genome-scale disk-based suffix tree indexing

B Phoophakdee, MJ Zaki - Proceedings of the 2007 ACM SIGMOD …, 2007 - dl.acm.org
With the exponential growth of biological sequence databases, it has become critical to
develop effective techniques for storing, querying, and analyzing these massive data. Suffix …

Efficient implementation of lazy suffix trees

R Giegerich, S Kurtz, J Stoye - Software: Practice and …, 2003 - Wiley Online Library
We present an efficient implementation of a write‐only top‐down construction for suffix trees.
Our implementation is based on a new, space‐efficient representation of suffix trees that …

Practical suffix tree construction

S Tata, RA Hankins, JM Patel - VLDB, 2004 - books.google.com
Large string datasets are common in a number of emerging text and biological database
applications. Common queries over such datasets include both exact and approximate …

A space and time efficient algorithm for constructing compressed suffix arrays

WK Hon, TW Lam, K Sadakane, WK Sung, SM Yiu - Algorithmica, 2007 - Springer
With the first human DNA being decoded into a sequence of about 2.8 billion characters,
much biological research has been centered on analyzing this sequence. Theoretically …

Constructing suffix tree for gigabyte sequences with megabyte memory

CF Cheung, JX Yu, H Lu - IEEE Transactions on Knowledge …, 2005 - ieeexplore.ieee.org
Mammalian genomes are typically 3 Gbps (gibabase pairs) in size. The largest public
database NCBI (National Center for Biotechnology Information (http://www. ncbi. nlm. nih …