Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences

<p>Abstract</p> <p>Background</p> <p>The estimation of the difference between two evolutionary distances within a triplet of homologs is a common operation that is used for example to determine which of two sequences is closer to a third one. The most accurate method is...

Full description

Bibliographic Details
Main Authors: Schneider Adrian, Gil Manuel, Dessimoz Christophe, Gonnet Gaston H
Format: Article
Language:English
Published: BMC 2006-12-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/529
id doaj-bc0e60d08d8e406cbf3adff1adf7da11
record_format Article
spelling doaj-bc0e60d08d8e406cbf3adff1adf7da112020-11-24T23:56:30ZengBMCBMC Bioinformatics1471-21052006-12-017152910.1186/1471-2105-7-529Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequencesSchneider AdrianGil ManuelDessimoz ChristopheGonnet Gaston H<p>Abstract</p> <p>Background</p> <p>The estimation of the difference between two evolutionary distances within a triplet of homologs is a common operation that is used for example to determine which of two sequences is closer to a third one. The most accurate method is currently maximum likelihood over the entire triplet. However, this approach is relatively time consuming.</p> <p>Results</p> <p>We show that an alternative estimator, based on pairwise estimates and therefore much faster to compute, has almost the same statistical power as the maximum likelihood estimator. We also provide a numerical approximation for its variance, which could otherwise only be estimated through an expensive re-sampling approach such as bootstrapping. An extensive simulation demonstrates that the approximation delivers precise confidence intervals. To illustrate the possible applications of these results, we show how they improve the detection of asymmetric evolution, and the identification of the closest relative to a given sequence in a group of homologs.</p> <p>Conclusion</p> <p>The results presented in this paper constitute a basis for large-scale protein cross-comparisons of pairwise evolutionary distances.</p> http://www.biomedcentral.com/1471-2105/7/529
collection DOAJ
language English
format Article
sources DOAJ
author Schneider Adrian
Gil Manuel
Dessimoz Christophe
Gonnet Gaston H
spellingShingle Schneider Adrian
Gil Manuel
Dessimoz Christophe
Gonnet Gaston H
Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
BMC Bioinformatics
author_facet Schneider Adrian
Gil Manuel
Dessimoz Christophe
Gonnet Gaston H
author_sort Schneider Adrian
title Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
title_short Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
title_full Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
title_fullStr Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
title_full_unstemmed Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences
title_sort fast estimation of the difference between two pam/jtt evolutionary distances in triplets of homologous sequences
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2006-12-01
description <p>Abstract</p> <p>Background</p> <p>The estimation of the difference between two evolutionary distances within a triplet of homologs is a common operation that is used for example to determine which of two sequences is closer to a third one. The most accurate method is currently maximum likelihood over the entire triplet. However, this approach is relatively time consuming.</p> <p>Results</p> <p>We show that an alternative estimator, based on pairwise estimates and therefore much faster to compute, has almost the same statistical power as the maximum likelihood estimator. We also provide a numerical approximation for its variance, which could otherwise only be estimated through an expensive re-sampling approach such as bootstrapping. An extensive simulation demonstrates that the approximation delivers precise confidence intervals. To illustrate the possible applications of these results, we show how they improve the detection of asymmetric evolution, and the identification of the closest relative to a given sequence in a group of homologs.</p> <p>Conclusion</p> <p>The results presented in this paper constitute a basis for large-scale protein cross-comparisons of pairwise evolutionary distances.</p>
url http://www.biomedcentral.com/1471-2105/7/529
work_keys_str_mv AT schneideradrian fastestimationofthedifferencebetweentwopamjttevolutionarydistancesintripletsofhomologoussequences
AT gilmanuel fastestimationofthedifferencebetweentwopamjttevolutionarydistancesintripletsofhomologoussequences
AT dessimozchristophe fastestimationofthedifferencebetweentwopamjttevolutionarydistancesintripletsofhomologoussequences
AT gonnetgastonh fastestimationofthedifferencebetweentwopamjttevolutionarydistancesintripletsofhomologoussequences
_version_ 1725458130724192256