Publication View

184 Genome Informatics 12: 184–193 (2001) A Mini-Greedy Algorithm for Faster Structural RNA Stem-Loop Search (2008)

Abstract
When a set of coregulated genes share a common structural RNA motif, e.g. a hairpin, most motif search approaches fail to locate the covarying but structurally conserved motif. There do exist methods that can locate structural RNA motifs, like FOLDALIGN, but the main problem with these methods is that they are computationally expensive. In FOLDALIGN, a major contribution to this is the use of a greedy algorithm to construct the multiple alignment. To ensure good quality many redundant computations must be made. However, by applying the greedy algorithm on a carefully selected subset of sequences, near full greedy quality can be obtained. The basic idea is to estimate the order in which the sequences entered a good greedy alignment. If such a ranking, found from all pairwise alignments, is in good agreement with the order of appearance in the multiple alignment, the core structural motif can be found by performing the greedy algorithm on just the top sequences in the ranking. The ranking used in this mini-greedy algorithm is found by using two complementing approaches: 1) When interpreting the FOLDALIGN score as an inner product (kernel), the sequences can be ranked according to their distance to their center of mass; 2) We construct an algorithm that attempts to find the K closest sequences in the vector space associated

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.96.1406
Source http://www.jsbi.org/journal/GIW01/GIW01F19.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords RNA Structural alignment, greedy algor
Type text
Language English
Relation 10.1.1.21.2820, 10.1.1.9.201, 10.1.1.53.3629, 10.1.1.40.4778, 10.1.1.140.7684