MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data

Abstract Background Large nucleotide sequence datasets are becoming increasingly common objects of comparison. Complete bacterial genomes are reported almost everyday. This creates challenges for developing new multiple sequence alignment methods. Conve...

Full description

Bibliographic Details
Main Authors:	Kryukov Kirill, Saitou Naruya
Format:	Article
Language:	English
Published:	BMC 2010-03-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/11/142

id	doaj-fe323734ed7048cd89fdc0abb7679e1b
record_format	Article
spelling	doaj-fe323734ed7048cd89fdc0abb7679e1b2020-11-25T00:55:06ZengBMCBMC Bioinformatics1471-21052010-03-0111114210.1186/1471-2105-11-142MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale dataKryukov KirillSaitou Naruya<p>Abstract</p> <p>Background</p> <p>Large nucleotide sequence datasets are becoming increasingly common objects of comparison. Complete bacterial genomes are reported almost everyday. This creates challenges for developing new multiple sequence alignment methods. Conventional multiple alignment methods are based on pairwise alignment and/or progressive alignment techniques. These approaches have performance problems when the number of sequences is large and when dealing with genome scale sequences.</p> <p>Results</p> <p>We present a new method of multiple sequence alignment, called MISHIMA (Method for Inferring Sequence History In terms of Multiple Alignment), that does not depend on pairwise sequence comparison. A new algorithm is used to quickly find rare oligonucleotide sequences shared by all sequences. Divide and conquer approach is then applied to break the sequences into fragments that can be aligned independently by an external alignment program. These partial alignments are assembled together to form a complete alignment of the original sequences.</p> <p>Conclusions</p> <p>MISHIMA provides improved performance compared to the commonly used multiple alignment methods. As an example, six complete genome sequences of bacteria species <it>Helicobacter pylori </it>(about 1.7 Mb each) were successfully aligned in about 6 hours using a single PC.</p> http://www.biomedcentral.com/1471-2105/11/142
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Kryukov Kirill Saitou Naruya
spellingShingle	Kryukov Kirill Saitou Naruya MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data BMC Bioinformatics
author_facet	Kryukov Kirill Saitou Naruya
author_sort	Kryukov Kirill
title	MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
title_short	MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
title_full	MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
title_fullStr	MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
title_full_unstemmed	MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
title_sort	mishima - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2010-03-01
description	<p>Abstract</p> <p>Background</p> <p>Large nucleotide sequence datasets are becoming increasingly common objects of comparison. Complete bacterial genomes are reported almost everyday. This creates challenges for developing new multiple sequence alignment methods. Conventional multiple alignment methods are based on pairwise alignment and/or progressive alignment techniques. These approaches have performance problems when the number of sequences is large and when dealing with genome scale sequences.</p> <p>Results</p> <p>We present a new method of multiple sequence alignment, called MISHIMA (Method for Inferring Sequence History In terms of Multiple Alignment), that does not depend on pairwise sequence comparison. A new algorithm is used to quickly find rare oligonucleotide sequences shared by all sequences. Divide and conquer approach is then applied to break the sequences into fragments that can be aligned independently by an external alignment program. These partial alignments are assembled together to form a complete alignment of the original sequences.</p> <p>Conclusions</p> <p>MISHIMA provides improved performance compared to the commonly used multiple alignment methods. As an example, six complete genome sequences of bacteria species <it>Helicobacter pylori </it>(about 1.7 Mb each) were successfully aligned in about 6 hours using a single PC.</p>
url	http://www.biomedcentral.com/1471-2105/11/142
work_keys_str_mv	AT kryukovkirill mishimaanewmethodforhighspeedmultiplealignmentofnucleotidesequencesofbacterialgenomescaledata AT saitounaruya mishimaanewmethodforhighspeedmultiplealignmentofnucleotidesequencesofbacterialgenomescaledata
_version_	1725232015482028032

MISHIMA - a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data

Similar Items