NBLAST: a cluster variant of BLAST for NxN comparisons

Abstract Background The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally...

Full description

Bibliographic Details
Main Authors:	Dumontier Michel, Hogue Christopher WV
Format:	Article
Language:	English
Published:	BMC 2002-05-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/3/13

Description
Summary:	<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences.</p> <p>Results</p> <p>NBLAST performs the heuristic BLAST algorithm and generates an exhaustive database of alignments, but it only computes <graphic file="1471-2105-3-13-i1.gif"/> alignments (i.e. the upper triangle) of a possible N<sup>2</sup> alignments, where N is the set of all sequences to be compared. A task-partitioning algorithm allows for cluster computing across all cluster nodes and the NBLAST master process produces a BLAST sequence alignment database and a list of sequence neighbours for each sequence record. The resulting sequence alignment and neighbour databases are used to serve the SeqHound query system through a C/C++ and PERL Application Programming Interface (API).</p> <p>Conclusions</p> <p>NBLAST offers a local alternative to the NCBI's remote Entrez system for pre-computed BLAST alignments and neighbour queries. On our 216-processor 450 MHz PIII cluster, NBLAST requires ~24 hrs to compute neighbours for 850000 proteins currently in the non-redundant protein database.</p>
ISSN:	1471-2105

NBLAST: a cluster variant of BLAST for NxN comparisons

Similar Items