Shortest triplet clustering: reconstructing large phylogenies using representative sets

Abstract Background Understanding the evolutionary relationships among species based on their genetic information is one of the primary objectives in phylogenetic analysis. Reconstructing phylogenies for large data sets is still a challenging task in Bi...

Full description

Bibliographic Details
Main Authors:	Sy Vinh Le, von Haeseler Arndt
Format:	Article
Language:	English
Published:	BMC 2005-04-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/6/92

id	doaj-cf10d66c266f4ee4ab1857571614a020
record_format	Article
spelling	doaj-cf10d66c266f4ee4ab1857571614a0202020-11-25T00:23:56ZengBMCBMC Bioinformatics1471-21052005-04-01619210.1186/1471-2105-6-92Shortest triplet clustering: reconstructing large phylogenies using representative setsSy Vinh Levon Haeseler Arndt<p>Abstract</p> <p>Background</p> <p>Understanding the evolutionary relationships among species based on their genetic information is one of the primary objectives in phylogenetic analysis. Reconstructing phylogenies for large data sets is still a challenging task in Bioinformatics.</p> <p>Results</p> <p>We propose a new distance-based clustering method, <it>the shortest triplet clustering algorithm (STC)</it>, to reconstruct phylogenies. The main idea is the introduction of a natural definition of so-called <it>k-representative sets</it>. Based on <it>k</it>-representative sets, <it>shortest triplets </it>are reconstructed and serve as building blocks for the STC algorithm to agglomerate sequences for tree reconstruction in <it>O</it>(<it>n</it><sup>2</sup>) time for <it>n </it>sequences.</p> <p>Simulations show that STC gives better topological accuracy than other tested methods that also build a first starting tree. STC appears as a very good method to start the tree reconstruction. However, all tested methods give similar results if balanced nearest neighbor interchange (BNNI) is applied as a post-processing step. BNNI leads to an improvement in all instances. The program is available at <url>http://www.bi.uni-duesseldorf.de/software/stc/</url>.</p> <p>Conclusion</p> <p>The results demonstrate that the new approach efficiently reconstructs phylogenies for large data sets. We found that BNNI boosts the topological accuracy of all methods including STC, therefore, one should use BNNI as a post-processing step to get better topological accuracy.</p> http://www.biomedcentral.com/1471-2105/6/92
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Sy Vinh Le von Haeseler Arndt
spellingShingle	Sy Vinh Le von Haeseler Arndt Shortest triplet clustering: reconstructing large phylogenies using representative sets BMC Bioinformatics
author_facet	Sy Vinh Le von Haeseler Arndt
author_sort	Sy Vinh Le
title	Shortest triplet clustering: reconstructing large phylogenies using representative sets
title_short	Shortest triplet clustering: reconstructing large phylogenies using representative sets
title_full	Shortest triplet clustering: reconstructing large phylogenies using representative sets
title_fullStr	Shortest triplet clustering: reconstructing large phylogenies using representative sets
title_full_unstemmed	Shortest triplet clustering: reconstructing large phylogenies using representative sets
title_sort	shortest triplet clustering: reconstructing large phylogenies using representative sets
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2005-04-01
description	<p>Abstract</p> <p>Background</p> <p>Understanding the evolutionary relationships among species based on their genetic information is one of the primary objectives in phylogenetic analysis. Reconstructing phylogenies for large data sets is still a challenging task in Bioinformatics.</p> <p>Results</p> <p>We propose a new distance-based clustering method, <it>the shortest triplet clustering algorithm (STC)</it>, to reconstruct phylogenies. The main idea is the introduction of a natural definition of so-called <it>k-representative sets</it>. Based on <it>k</it>-representative sets, <it>shortest triplets </it>are reconstructed and serve as building blocks for the STC algorithm to agglomerate sequences for tree reconstruction in <it>O</it>(<it>n</it><sup>2</sup>) time for <it>n </it>sequences.</p> <p>Simulations show that STC gives better topological accuracy than other tested methods that also build a first starting tree. STC appears as a very good method to start the tree reconstruction. However, all tested methods give similar results if balanced nearest neighbor interchange (BNNI) is applied as a post-processing step. BNNI leads to an improvement in all instances. The program is available at <url>http://www.bi.uni-duesseldorf.de/software/stc/</url>.</p> <p>Conclusion</p> <p>The results demonstrate that the new approach efficiently reconstructs phylogenies for large data sets. We found that BNNI boosts the topological accuracy of all methods including STC, therefore, one should use BNNI as a post-processing step to get better topological accuracy.</p>
url	http://www.biomedcentral.com/1471-2105/6/92
work_keys_str_mv	AT syvinhle shortesttripletclusteringreconstructinglargephylogeniesusingrepresentativesets AT vonhaeselerarndt shortesttripletclusteringreconstructinglargephylogeniesusingrepresentativesets
_version_	1725354937237372928

Shortest triplet clustering: reconstructing large phylogenies using representative sets

Similar Items