Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0229493 |
id |
doaj-c919fc1c881f447fa098d308f7d75ebd |
---|---|
record_format |
Article |
spelling |
doaj-c919fc1c881f447fa098d308f7d75ebd2021-03-04T12:28:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01153e022949310.1371/journal.pone.0229493Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.Frank ManninoSadie WisotskySergei L Kosakovsky PondSpencer V MuseIt is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.https://doi.org/10.1371/journal.pone.0229493 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Frank Mannino Sadie Wisotsky Sergei L Kosakovsky Pond Spencer V Muse |
spellingShingle |
Frank Mannino Sadie Wisotsky Sergei L Kosakovsky Pond Spencer V Muse Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. PLoS ONE |
author_facet |
Frank Mannino Sadie Wisotsky Sergei L Kosakovsky Pond Spencer V Muse |
author_sort |
Frank Mannino |
title |
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
title_short |
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
title_full |
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
title_fullStr |
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
title_full_unstemmed |
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
title_sort |
equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered. |
url |
https://doi.org/10.1371/journal.pone.0229493 |
work_keys_str_mv |
AT frankmannino equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability AT sadiewisotsky equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability AT sergeilkosakovskypond equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability AT spencervmuse equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability |
_version_ |
1714802544591503360 |