Publication View

Protein Similarity Search with Subset Seeds on a Dedicated Reconfigurable Hardware (2009)

Abstract
Abstract. With a sharp increase of available DNA and protein sequence data, new precise and fast similarity search methods are needed for largescale genome and proteome comparisons. Modern seed-based techniques of similarity search (spaced seeds, multiple seeds, subset seeds) provide a better sensitivity/specificity ratio. We present an implementation of such a seed-based technique on a parallel specialized hardware embedding reconfigurable architecture (FPGA), where the FPGA is tightly connected to large capacity Flash memories. This parallel system allows large databases to be fully indexed and rapidly accessed. Compared to traditional approaches presented by the Blastp software, we obtain both a significant speed-up and better results. To the best of our knowledge, this is the first attempt to exploit efficient seed-based algorithms for parallelizing the sequence similarity search.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.140.3962
Source http://www.lifl.fr/~giraud/publis/peterlongo-pbc-07.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords sequence, similarity search, spaced seeds, subset seeds, indexing, FPGA
Type text
Language English
Relation 10.1.1.23.7142, 10.1.1.21.7158, 10.1.1.56.8109, 10.1.1.112.8499, 10.1.1.5.3974, 10.1.1.22.4924, 10.1.1.54.977, 10.1.1.140.4111, 10.1.1.67.9365, 10.1.1.83.4436, 10.1.1.103.422, 10.1.1.132.120, 10.1.1.60.7681, 10.1.1.87.9657, 10.1.1.106.17, 10.1.1.125.5019, 10.1.1.140.4062, 10.1.1.140.4198