Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains

Abstract Background Continuous time Markov chains (CTMCs) is a widely used model for describing the evolution of DNA sequences on the nucleotide, amino acid or codon level. The sufficient statistics for CTMCs are the time spent in a state and the number...

Full description

Bibliographic Details
Main Authors:	Tataru Paula, Hobolth Asger
Format:	Article
Language:	English
Published:	BMC 2011-12-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/12/465

id	doaj-0cd9de4420284ac6b34dddd643ef137d
record_format	Article
spelling	doaj-0cd9de4420284ac6b34dddd643ef137d2020-11-25T00:33:42ZengBMCBMC Bioinformatics1471-21052011-12-0112146510.1186/1471-2105-12-465Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chainsTataru PaulaHobolth Asger<p>Abstract</p> <p>Background</p> <p>Continuous time Markov chains (CTMCs) is a widely used model for describing the evolution of DNA sequences on the nucleotide, amino acid or codon level. The sufficient statistics for CTMCs are the time spent in a state and the number of changes between any two states. In applications past evolutionary events (exact times and types of changes) are unaccessible and the past must be inferred from DNA sequence data observed in the present.</p> <p>Results</p> <p>We describe and implement three algorithms for computing linear combinations of expected values of the sufficient statistics, conditioned on the end-points of the chain, and compare their performance with respect to accuracy and running time. The first algorithm is based on an eigenvalue decomposition of the rate matrix (EVD), the second on uniformization (UNI), and the third on integrals of matrix exponentials (EXPM). The implementation in R of the algorithms is available at <url>http://www.birc.au.dk/~paula/</url>.</p> <p>Conclusions</p> <p>We use two different models to analyze the accuracy and eight experiments to investigate the speed of the three algorithms. We find that they have similar accuracy and that EXPM is the slowest method. Furthermore we find that UNI is usually faster than EVD.</p> http://www.biomedcentral.com/1471-2105/12/465
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Tataru Paula Hobolth Asger
spellingShingle	Tataru Paula Hobolth Asger Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains BMC Bioinformatics
author_facet	Tataru Paula Hobolth Asger
author_sort	Tataru Paula
title	Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains
title_short	Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains
title_full	Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains
title_fullStr	Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains
title_full_unstemmed	Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains
title_sort	comparison of methods for calculating conditional expectations of sufficient statistics for continuous time markov chains
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2011-12-01
description	<p>Abstract</p> <p>Background</p> <p>Continuous time Markov chains (CTMCs) is a widely used model for describing the evolution of DNA sequences on the nucleotide, amino acid or codon level. The sufficient statistics for CTMCs are the time spent in a state and the number of changes between any two states. In applications past evolutionary events (exact times and types of changes) are unaccessible and the past must be inferred from DNA sequence data observed in the present.</p> <p>Results</p> <p>We describe and implement three algorithms for computing linear combinations of expected values of the sufficient statistics, conditioned on the end-points of the chain, and compare their performance with respect to accuracy and running time. The first algorithm is based on an eigenvalue decomposition of the rate matrix (EVD), the second on uniformization (UNI), and the third on integrals of matrix exponentials (EXPM). The implementation in R of the algorithms is available at <url>http://www.birc.au.dk/~paula/</url>.</p> <p>Conclusions</p> <p>We use two different models to analyze the accuracy and eight experiments to investigate the speed of the three algorithms. We find that they have similar accuracy and that EXPM is the slowest method. Furthermore we find that UNI is usually faster than EVD.</p>
url	http://www.biomedcentral.com/1471-2105/12/465
work_keys_str_mv	AT tatarupaula comparisonofmethodsforcalculatingconditionalexpectationsofsufficientstatisticsforcontinuoustimemarkovchains AT hobolthasger comparisonofmethodsforcalculatingconditionalexpectationsofsufficientstatisticsforcontinuoustimemarkovchains
_version_	1725315326603689984

Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains

Similar Items