Protein database searches using compositionally adjusted substitution matrices

SF Altschul, JC Wootton, EM Gertz… - The FEBS …, 2005 - Wiley Online Library
Almost all protein database search methods use amino acid substitution matrices for
scoring, optimizing, and assessing the statistical significance of sequence alignments. Much …

Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements

AA Schäffer, L Aravind, TL Madden… - Nucleic acids …, 2001 - academic.oup.com
PSI-BLAST is an iterative program to search a database for proteins with distant similarity to
a query sequence. We investigated over a dozen modifications to the methods used in PSI …

Accelerated profile HMM searches

SR Eddy - PLoS computational biology, 2011 - journals.plos.org
Profile hidden Markov models (profile HMMs) and probabilistic inference methods have
made important contributions to the theory of sequence database homology search …

A new generation of homology search tools based on probabilistic inference

SR Eddy - Genome Informatics 2009: Genome Informatics Series …, 2009 - World Scientific
Many theoretical advances have been made in applying probabilistic inference methods to
improve the power of sequence homology searches, yet the BLAST suite of programs is still …

Discovering microRNAs from deep sequencing data using miRDeep

MR Friedländer, W Chen, C Adamidi, J Maaskola… - Nature …, 2008 - nature.com
The capacity of highly parallel sequencing technologies to detect small RNAs at
unprecedented depth suggests their value in systematically identifying microRNAs …

A probabilistic model of local sequence alignment that simplifies statistical significance estimation

SR Eddy - PLoS computational biology, 2008 - journals.plos.org
Sequence database searches require accurate estimation of the statistical significance of
scores. Optimal local sequence alignment scores follow Gumbel distributions, but …

[KNIHA][B] Blast

I Korf, M Yandell, J Bedell - 2003 - books.google.com
Sequence similarity is a powerful tool for discovering biological function. Just as the ancient
Greeks used comparative anatomy to understand the human body and linguists used the …

The CATH database: an extended protein family resource for structural and functional genomics

FMG Pearl, CF Bennett, JE Bray… - Nucleic acids …, 2003 - academic.oup.com
The CATH database of protein domain structures (http://www. biochem. ucl. ac.
uk/bsm/cath_new) currently contains 34 287 domain structures classified into 1383 …

COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance

R Sadreyev, N Grishin - Journal of molecular biology, 2003 - Elsevier
We present a novel method for the comparison of multiple protein alignments with
assessment of statistical significance (COMPASS). The method derives numerical profiles …

RSEARCH: finding homologs of single structured RNA sequences

RJ Klein, SR Eddy - BMC bioinformatics, 2003 - Springer
Background For many RNA molecules, secondary structure rather than primary sequence is
the evolutionarily conserved feature. No programs have yet been published that allow …