Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.

It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K...

Full description

Bibliographic Details
Main Authors: Frank Mannino, Sadie Wisotsky, Sergei L Kosakovsky Pond, Spencer V Muse
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0229493
id doaj-c919fc1c881f447fa098d308f7d75ebd
record_format Article
spelling doaj-c919fc1c881f447fa098d308f7d75ebd2021-03-04T12:28:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01153e022949310.1371/journal.pone.0229493Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.Frank ManninoSadie WisotskySergei L Kosakovsky PondSpencer V MuseIt is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.https://doi.org/10.1371/journal.pone.0229493
collection DOAJ
language English
format Article
sources DOAJ
author Frank Mannino
Sadie Wisotsky
Sergei L Kosakovsky Pond
Spencer V Muse
spellingShingle Frank Mannino
Sadie Wisotsky
Sergei L Kosakovsky Pond
Spencer V Muse
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
PLoS ONE
author_facet Frank Mannino
Sadie Wisotsky
Sergei L Kosakovsky Pond
Spencer V Muse
author_sort Frank Mannino
title Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
title_short Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
title_full Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
title_fullStr Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
title_full_unstemmed Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
title_sort equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.
url https://doi.org/10.1371/journal.pone.0229493
work_keys_str_mv AT frankmannino equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT sadiewisotsky equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT sergeilkosakovskypond equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT spencervmuse equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
_version_ 1714802544591503360