Benchmarking multi-rate codon models.

The single rate codon model of non-synonymous substitution is ubiquitous in phylogenetic modeling. Indeed, the use of a non-synonymous to synonymous substitution rate ratio parameter has facilitated the interpretation of selection pressure on genomes. Although the single rate model has achieved wide...

Full description

Bibliographic Details
Main Authors:	Wayne Delport, Konrad Scheffler, Mike B Gravenor, Spencer V Muse, Sergei Kosakovsky Pond
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2010-07-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC2908124?pdf=render

id	doaj-c69d6af27dce42fbb0ac9813aa18aaab
record_format	Article
spelling	doaj-c69d6af27dce42fbb0ac9813aa18aaab2020-11-25T01:55:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032010-07-0157e1158710.1371/journal.pone.0011587Benchmarking multi-rate codon models.Wayne DelportKonrad SchefflerMike B GravenorSpencer V MuseSergei Kosakovsky PondThe single rate codon model of non-synonymous substitution is ubiquitous in phylogenetic modeling. Indeed, the use of a non-synonymous to synonymous substitution rate ratio parameter has facilitated the interpretation of selection pressure on genomes. Although the single rate model has achieved wide acceptance, we argue that the assumption of a single rate of non-synonymous substitution is biologically unreasonable, given observed differences in substitution rates evident from empirical amino acid models. Some have attempted to incorporate amino acid substitution biases into models of codon evolution and have shown improved model performance versus the single rate model. Here, we show that the single rate model of non-synonymous substitution is easily outperformed by a model with multiple non-synonymous rate classes, yet in which amino acid substitution pairs are assigned randomly to these classes. We argue that, since the single rate model is so easy to improve upon, new codon models should not be validated entirely on the basis of improved model fit over this model. Rather, we should strive to both improve on the single rate model and to approximate the general time-reversible model of codon substitution, with as few parameters as possible, so as to reduce model over-fitting. We hint at how this can be achieved with a Genetic Algorithm approach in which rate classes are assigned on the basis of sequence information content.http://europepmc.org/articles/PMC2908124?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Wayne Delport Konrad Scheffler Mike B Gravenor Spencer V Muse Sergei Kosakovsky Pond
spellingShingle	Wayne Delport Konrad Scheffler Mike B Gravenor Spencer V Muse Sergei Kosakovsky Pond Benchmarking multi-rate codon models. PLoS ONE
author_facet	Wayne Delport Konrad Scheffler Mike B Gravenor Spencer V Muse Sergei Kosakovsky Pond
author_sort	Wayne Delport
title	Benchmarking multi-rate codon models.
title_short	Benchmarking multi-rate codon models.
title_full	Benchmarking multi-rate codon models.
title_fullStr	Benchmarking multi-rate codon models.
title_full_unstemmed	Benchmarking multi-rate codon models.
title_sort	benchmarking multi-rate codon models.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2010-07-01
description	The single rate codon model of non-synonymous substitution is ubiquitous in phylogenetic modeling. Indeed, the use of a non-synonymous to synonymous substitution rate ratio parameter has facilitated the interpretation of selection pressure on genomes. Although the single rate model has achieved wide acceptance, we argue that the assumption of a single rate of non-synonymous substitution is biologically unreasonable, given observed differences in substitution rates evident from empirical amino acid models. Some have attempted to incorporate amino acid substitution biases into models of codon evolution and have shown improved model performance versus the single rate model. Here, we show that the single rate model of non-synonymous substitution is easily outperformed by a model with multiple non-synonymous rate classes, yet in which amino acid substitution pairs are assigned randomly to these classes. We argue that, since the single rate model is so easy to improve upon, new codon models should not be validated entirely on the basis of improved model fit over this model. Rather, we should strive to both improve on the single rate model and to approximate the general time-reversible model of codon substitution, with as few parameters as possible, so as to reduce model over-fitting. We hint at how this can be achieved with a Genetic Algorithm approach in which rate classes are assigned on the basis of sequence information content.
url	http://europepmc.org/articles/PMC2908124?pdf=render
work_keys_str_mv	AT waynedelport benchmarkingmultiratecodonmodels AT konradscheffler benchmarkingmultiratecodonmodels AT mikebgravenor benchmarkingmultiratecodonmodels AT spencervmuse benchmarkingmultiratecodonmodels AT sergeikosakovskypond benchmarkingmultiratecodonmodels
_version_	1724982709115158528

Benchmarking multi-rate codon models.

Similar Items