NBLAST: a cluster variant of BLAST for NxN comparisons

<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally...

Full description

Bibliographic Details
Main Authors: Dumontier Michel, Hogue Christopher WV
Format: Article
Language:English
Published: BMC 2002-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/3/13
id doaj-5c53e17587fe47628156e7fc57cde04a
record_format Article
spelling doaj-5c53e17587fe47628156e7fc57cde04a2020-11-25T02:09:28ZengBMCBMC Bioinformatics1471-21052002-05-01311310.1186/1471-2105-3-13NBLAST: a cluster variant of BLAST for NxN comparisonsDumontier MichelHogue Christopher WV<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences.</p> <p>Results</p> <p>NBLAST performs the heuristic BLAST algorithm and generates an exhaustive database of alignments, but it only computes <graphic file="1471-2105-3-13-i1.gif"/> alignments (i.e. the upper triangle) of a possible N<sup>2</sup> alignments, where N is the set of all sequences to be compared. A task-partitioning algorithm allows for cluster computing across all cluster nodes and the NBLAST master process produces a BLAST sequence alignment database and a list of sequence neighbours for each sequence record. The resulting sequence alignment and neighbour databases are used to serve the SeqHound query system through a C/C++ and PERL Application Programming Interface (API).</p> <p>Conclusions</p> <p>NBLAST offers a local alternative to the NCBI's remote Entrez system for pre-computed BLAST alignments and neighbour queries. On our 216-processor 450 MHz PIII cluster, NBLAST requires ~24 hrs to compute neighbours for 850000 proteins currently in the non-redundant protein database.</p> http://www.biomedcentral.com/1471-2105/3/13
collection DOAJ
language English
format Article
sources DOAJ
author Dumontier Michel
Hogue Christopher WV
spellingShingle Dumontier Michel
Hogue Christopher WV
NBLAST: a cluster variant of BLAST for NxN comparisons
BMC Bioinformatics
author_facet Dumontier Michel
Hogue Christopher WV
author_sort Dumontier Michel
title NBLAST: a cluster variant of BLAST for NxN comparisons
title_short NBLAST: a cluster variant of BLAST for NxN comparisons
title_full NBLAST: a cluster variant of BLAST for NxN comparisons
title_fullStr NBLAST: a cluster variant of BLAST for NxN comparisons
title_full_unstemmed NBLAST: a cluster variant of BLAST for NxN comparisons
title_sort nblast: a cluster variant of blast for nxn comparisons
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2002-05-01
description <p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences.</p> <p>Results</p> <p>NBLAST performs the heuristic BLAST algorithm and generates an exhaustive database of alignments, but it only computes <graphic file="1471-2105-3-13-i1.gif"/> alignments (i.e. the upper triangle) of a possible N<sup>2</sup> alignments, where N is the set of all sequences to be compared. A task-partitioning algorithm allows for cluster computing across all cluster nodes and the NBLAST master process produces a BLAST sequence alignment database and a list of sequence neighbours for each sequence record. The resulting sequence alignment and neighbour databases are used to serve the SeqHound query system through a C/C++ and PERL Application Programming Interface (API).</p> <p>Conclusions</p> <p>NBLAST offers a local alternative to the NCBI's remote Entrez system for pre-computed BLAST alignments and neighbour queries. On our 216-processor 450 MHz PIII cluster, NBLAST requires ~24 hrs to compute neighbours for 850000 proteins currently in the non-redundant protein database.</p>
url http://www.biomedcentral.com/1471-2105/3/13
work_keys_str_mv AT dumontiermichel nblastaclustervariantofblastfornxncomparisons
AT hoguechristopherwv nblastaclustervariantofblastfornxncomparisons
_version_ 1724923623416791040