MUSCLE: a multiple sequence alignment method with reduced time and space complexity

<p>Abstract</p> <p>Background</p> <p>In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignme...

Full description

Bibliographic Details
Main Author: Edgar Robert C
Format: Article
Language:English
Published: BMC 2004-08-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/5/113
id doaj-e4d99c0bb09c45c694442b71a825ca0f
record_format Article
spelling doaj-e4d99c0bb09c45c694442b71a825ca0f2020-11-24T21:10:30ZengBMCBMC Bioinformatics1471-21052004-08-015111310.1186/1471-2105-5-113MUSCLE: a multiple sequence alignment method with reduced time and space complexityEdgar Robert C<p>Abstract</p> <p>Background</p> <p>In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles.</p> <p>Results</p> <p>We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLE-fast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer.</p> <p>Conclusions</p> <p>MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at <url>http://www.drive5.com/muscle</url>.</p> http://www.biomedcentral.com/1471-2105/5/113
collection DOAJ
language English
format Article
sources DOAJ
author Edgar Robert C
spellingShingle Edgar Robert C
MUSCLE: a multiple sequence alignment method with reduced time and space complexity
BMC Bioinformatics
author_facet Edgar Robert C
author_sort Edgar Robert C
title MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_short MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_full MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_fullStr MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_full_unstemmed MUSCLE: a multiple sequence alignment method with reduced time and space complexity
title_sort muscle: a multiple sequence alignment method with reduced time and space complexity
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2004-08-01
description <p>Abstract</p> <p>Background</p> <p>In a previous paper, we introduced MUSCLE, a new program for creating multiple alignments of protein sequences, giving a brief summary of the algorithm and showing MUSCLE to achieve the highest scores reported to date on four alignment accuracy benchmarks. Here we present a more complete discussion of the algorithm, describing several previously unpublished techniques that improve biological accuracy and / or computational complexity. We introduce a new option, MUSCLE-fast, designed for high-throughput applications. We also describe a new protocol for evaluating objective functions that align two profiles.</p> <p>Results</p> <p>We compare the speed and accuracy of MUSCLE with CLUSTALW, Progressive POA and the MAFFT script FFTNS1, the fastest previously published program known to the author. Accuracy is measured using four benchmarks: BAliBASE, PREFAB, SABmark and SMART. We test three variants that offer highest accuracy (MUSCLE with default settings), highest speed (MUSCLE-fast), and a carefully chosen compromise between the two (MUSCLE-prog). We find MUSCLE-fast to be the fastest algorithm on all test sets, achieving average alignment accuracy similar to CLUSTALW in times that are typically two to three orders of magnitude less. MUSCLE-fast is able to align 1,000 sequences of average length 282 in 21 seconds on a current desktop computer.</p> <p>Conclusions</p> <p>MUSCLE offers a range of options that provide improved speed and / or alignment accuracy compared with currently available programs. MUSCLE is freely available at <url>http://www.drive5.com/muscle</url>.</p>
url http://www.biomedcentral.com/1471-2105/5/113
work_keys_str_mv AT edgarrobertc muscleamultiplesequencealignmentmethodwithreducedtimeandspacecomplexity
_version_ 1716756335016542208