[BOOK][B] Handbook of computational molecular biology
S Aluru - 2005 - taylorfrancis.com
The enormous complexity of biological systems at the molecular level must be answered
with powerful computational methods. Computational biology is a young field, but has seen …
with powerful computational methods. Computational biology is a young field, but has seen …
On the performance of bitmap indices for high cardinality attributes
It is well established that bitmap indices are efficient for read-only attributes with low attribute
cardinalities. For an attribute with a high cardinality, the size of the bitmap index can be very …
cardinalities. For an attribute with a high cardinality, the size of the bitmap index can be very …
Bio-strings: a relational database data-type for dealing with large biosequences
DNA sequencers output a large set of very long biological data strings that we should persist
in databases rather than basic text file systems. Many different data models and database …
in databases rather than basic text file systems. Many different data models and database …
Practical methods for constructing suffix trees
Sequence datasets are ubiquitous in modern life-science applications, and querying
sequences is a common and critical operation in many of these applications. The suffix tree …
sequences is a common and critical operation in many of these applications. The suffix tree …
Oasis: An online and accurate technique for local-alignment searches on biological sequences
C Meek, JM Patel, S Kasetty - Proceedings 2003 VLDB Conference, 2003 - Elsevier
Publisher Summary This chapter introduces a new algorithm called online and accurate
search technique for Inferring local-alignments on sequences (OASIS), which improves …
search technique for Inferring local-alignments on sequences (OASIS), which improves …
Genome-scale disk-based suffix tree indexing
With the exponential growth of biological sequence databases, it has become critical to
develop effective techniques for storing, querying, and analyzing these massive data. Suffix …
develop effective techniques for storing, querying, and analyzing these massive data. Suffix …
Efficient implementation of lazy suffix trees
We present an efficient implementation of a write‐only top‐down construction for suffix trees.
Our implementation is based on a new, space‐efficient representation of suffix trees that …
Our implementation is based on a new, space‐efficient representation of suffix trees that …
Practical suffix tree construction
Large string datasets are common in a number of emerging text and biological database
applications. Common queries over such datasets include both exact and approximate …
applications. Common queries over such datasets include both exact and approximate …
A space and time efficient algorithm for constructing compressed suffix arrays
With the first human DNA being decoded into a sequence of about 2.8 billion characters,
much biological research has been centered on analyzing this sequence. Theoretically …
much biological research has been centered on analyzing this sequence. Theoretically …
Constructing suffix tree for gigabyte sequences with megabyte memory
CF Cheung, JX Yu, H Lu - IEEE Transactions on Knowledge …, 2005 - ieeexplore.ieee.org
Mammalian genomes are typically 3 Gbps (gibabase pairs) in size. The largest public
database NCBI (National Center for Biotechnology Information (http://www. ncbi. nlm. nih …
database NCBI (National Center for Biotechnology Information (http://www. ncbi. nlm. nih …