NBLAST: a cluster variant of BLAST for NxN comparisons
<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2002-05-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/3/13 |
id |
doaj-5c53e17587fe47628156e7fc57cde04a |
---|---|
record_format |
Article |
spelling |
doaj-5c53e17587fe47628156e7fc57cde04a2020-11-25T02:09:28ZengBMCBMC Bioinformatics1471-21052002-05-01311310.1186/1471-2105-3-13NBLAST: a cluster variant of BLAST for NxN comparisonsDumontier MichelHogue Christopher WV<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences.</p> <p>Results</p> <p>NBLAST performs the heuristic BLAST algorithm and generates an exhaustive database of alignments, but it only computes <graphic file="1471-2105-3-13-i1.gif"/> alignments (i.e. the upper triangle) of a possible N<sup>2</sup> alignments, where N is the set of all sequences to be compared. A task-partitioning algorithm allows for cluster computing across all cluster nodes and the NBLAST master process produces a BLAST sequence alignment database and a list of sequence neighbours for each sequence record. The resulting sequence alignment and neighbour databases are used to serve the SeqHound query system through a C/C++ and PERL Application Programming Interface (API).</p> <p>Conclusions</p> <p>NBLAST offers a local alternative to the NCBI's remote Entrez system for pre-computed BLAST alignments and neighbour queries. On our 216-processor 450 MHz PIII cluster, NBLAST requires ~24 hrs to compute neighbours for 850000 proteins currently in the non-redundant protein database.</p> http://www.biomedcentral.com/1471-2105/3/13 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Dumontier Michel Hogue Christopher WV |
spellingShingle |
Dumontier Michel Hogue Christopher WV NBLAST: a cluster variant of BLAST for NxN comparisons BMC Bioinformatics |
author_facet |
Dumontier Michel Hogue Christopher WV |
author_sort |
Dumontier Michel |
title |
NBLAST: a cluster variant of BLAST for NxN comparisons |
title_short |
NBLAST: a cluster variant of BLAST for NxN comparisons |
title_full |
NBLAST: a cluster variant of BLAST for NxN comparisons |
title_fullStr |
NBLAST: a cluster variant of BLAST for NxN comparisons |
title_full_unstemmed |
NBLAST: a cluster variant of BLAST for NxN comparisons |
title_sort |
nblast: a cluster variant of blast for nxn comparisons |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2002-05-01 |
description |
<p>Abstract</p> <p>Background</p> <p>The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences.</p> <p>Results</p> <p>NBLAST performs the heuristic BLAST algorithm and generates an exhaustive database of alignments, but it only computes <graphic file="1471-2105-3-13-i1.gif"/> alignments (i.e. the upper triangle) of a possible N<sup>2</sup> alignments, where N is the set of all sequences to be compared. A task-partitioning algorithm allows for cluster computing across all cluster nodes and the NBLAST master process produces a BLAST sequence alignment database and a list of sequence neighbours for each sequence record. The resulting sequence alignment and neighbour databases are used to serve the SeqHound query system through a C/C++ and PERL Application Programming Interface (API).</p> <p>Conclusions</p> <p>NBLAST offers a local alternative to the NCBI's remote Entrez system for pre-computed BLAST alignments and neighbour queries. On our 216-processor 450 MHz PIII cluster, NBLAST requires ~24 hrs to compute neighbours for 850000 proteins currently in the non-redundant protein database.</p> |
url |
http://www.biomedcentral.com/1471-2105/3/13 |
work_keys_str_mv |
AT dumontiermichel nblastaclustervariantofblastfornxncomparisons AT hoguechristopherwv nblastaclustervariantofblastfornxncomparisons |
_version_ |
1724923623416791040 |