Phylogenies from dynamic networks.

The relationship between the underlying contact network over which a pathogen spreads and the pathogen phylogenetic trees that are obtained presents an opportunity to use sequence data to learn about contact networks that are difficult to study empirically. However, this relationship is not explicit...

Full description

Bibliographic Details
Main Authors: Cornelia Metzig, Oliver Ratmann, Daniela Bezemer, Caroline Colijn
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-02-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1006761
id doaj-95acad202bad41d188f24c5447d16243
record_format Article
spelling doaj-95acad202bad41d188f24c5447d162432021-04-21T15:11:53ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582019-02-01152e100676110.1371/journal.pcbi.1006761Phylogenies from dynamic networks.Cornelia MetzigOliver RatmannDaniela BezemerCaroline ColijnThe relationship between the underlying contact network over which a pathogen spreads and the pathogen phylogenetic trees that are obtained presents an opportunity to use sequence data to learn about contact networks that are difficult to study empirically. However, this relationship is not explicitly known and is usually studied in simulations, often with the simplifying assumption that the contact network is static in time, though human contact networks are dynamic. We simulate pathogen phylogenetic trees on dynamic Erdős-Renyi random networks and on two dynamic networks with skewed degree distribution, of which one is additionally clustered. We use tree shape features to explore how adding dynamics changes the relationships between the overall network structure and phylogenies. Our tree features include the number of small substructures (cherries, pitchforks) in the trees, measures of tree imbalance (Sackin index, Colless index), features derived from network science (diameter, closeness), as well as features using the internal branch lengths from the tip to the root. Using principal component analysis we find that the network dynamics influence the shapes of phylogenies, as does the network type. We also compare dynamic and time-integrated static networks. We find, in particular, that static network models like the widely used Barabasi-Albert model can be poor approximations for dynamic networks. We explore the effects of mis-specifying the network on the performance of classifiers trained identify the transmission rate (using supervised learning methods). We find that both mis-specification of the underlying network and its parameters (mean degree, turnover rate) have a strong adverse effect on the ability to estimate the transmission parameter. We illustrate these results by classifying HIV trees with a classifier that we trained on simulated trees from different networks, infection rates and turnover rates. Our results point to the importance of correctly estimating and modelling contact networks with dynamics when using phylodynamic tools to estimate epidemiological parameters.https://doi.org/10.1371/journal.pcbi.1006761
collection DOAJ
language English
format Article
sources DOAJ
author Cornelia Metzig
Oliver Ratmann
Daniela Bezemer
Caroline Colijn
spellingShingle Cornelia Metzig
Oliver Ratmann
Daniela Bezemer
Caroline Colijn
Phylogenies from dynamic networks.
PLoS Computational Biology
author_facet Cornelia Metzig
Oliver Ratmann
Daniela Bezemer
Caroline Colijn
author_sort Cornelia Metzig
title Phylogenies from dynamic networks.
title_short Phylogenies from dynamic networks.
title_full Phylogenies from dynamic networks.
title_fullStr Phylogenies from dynamic networks.
title_full_unstemmed Phylogenies from dynamic networks.
title_sort phylogenies from dynamic networks.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2019-02-01
description The relationship between the underlying contact network over which a pathogen spreads and the pathogen phylogenetic trees that are obtained presents an opportunity to use sequence data to learn about contact networks that are difficult to study empirically. However, this relationship is not explicitly known and is usually studied in simulations, often with the simplifying assumption that the contact network is static in time, though human contact networks are dynamic. We simulate pathogen phylogenetic trees on dynamic Erdős-Renyi random networks and on two dynamic networks with skewed degree distribution, of which one is additionally clustered. We use tree shape features to explore how adding dynamics changes the relationships between the overall network structure and phylogenies. Our tree features include the number of small substructures (cherries, pitchforks) in the trees, measures of tree imbalance (Sackin index, Colless index), features derived from network science (diameter, closeness), as well as features using the internal branch lengths from the tip to the root. Using principal component analysis we find that the network dynamics influence the shapes of phylogenies, as does the network type. We also compare dynamic and time-integrated static networks. We find, in particular, that static network models like the widely used Barabasi-Albert model can be poor approximations for dynamic networks. We explore the effects of mis-specifying the network on the performance of classifiers trained identify the transmission rate (using supervised learning methods). We find that both mis-specification of the underlying network and its parameters (mean degree, turnover rate) have a strong adverse effect on the ability to estimate the transmission parameter. We illustrate these results by classifying HIV trees with a classifier that we trained on simulated trees from different networks, infection rates and turnover rates. Our results point to the importance of correctly estimating and modelling contact networks with dynamics when using phylodynamic tools to estimate epidemiological parameters.
url https://doi.org/10.1371/journal.pcbi.1006761
work_keys_str_mv AT corneliametzig phylogeniesfromdynamicnetworks
AT oliverratmann phylogeniesfromdynamicnetworks
AT danielabezemer phylogeniesfromdynamicnetworks
AT carolinecolijn phylogeniesfromdynamicnetworks
_version_ 1714667801914900480