Statistical significance and extremal ensemble of gapped local hybrid alignment

  • Yu Y
  • Bundschuh R
  • Hwa T
N/ACitations
Citations of this article
22Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A "semi-probabilistic" alignment algorithm which combines ideas from Smith-Waterman and probabilistic alignment is proposed and studied in detail. It is predicted that the score statistics of this "hybrid" algorithm is of the universal Gumbel form, with the key Gumbel parameter λ taking on a fixed asymptotic value for a wide variety of scoring parameters. We have also characterized the "extremal ensemble", i.e., the collection of sequence pairs exhibiting similarities that a given scoring system is most sensitive to. Based on this extremal ensemble, a simple recipe for the computation of the "relative entropy", and from it the correction to λ due to finite sequence length is also given. This allows us to assign p-values to the alignment results for arbitrary scoring parameters and gap costs. The predictions compare well with direct numerical simulations for a broad range of sequence lengths with various choices of the substitution scores and affine gap parameters.

Cite

CITATION STYLE

APA

Yu, Y.-K., Bundschuh, R., & Hwa, T. (2007). Statistical significance and extremal ensemble of gapped local hybrid alignment. In Biological Evolution and Statistical Physics (pp. 3–21). Springer Berlin Heidelberg. https://doi.org/10.1007/3-540-45692-9_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free