Emergence of novel SARS-CoV-2 variants in the Netherlands
Abstract Coronavirus disease 2019 (COVID-19) has emerged in December 2019 when the first case was reported in Wuhan, China and turned into a pandemic with 27 million (September 9th) cases. Currently, there are over 95,000 complete genome sequences of the severe acute respiratory syndrome coronavirus...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-85363-7 |
id |
doaj-9c1c0c7c65b2453bac5a35c169170efe |
---|---|
record_format |
Article |
spelling |
doaj-9c1c0c7c65b2453bac5a35c169170efe2021-03-28T11:33:22ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111510.1038/s41598-021-85363-7Emergence of novel SARS-CoV-2 variants in the NetherlandsAysun Urhan0Thomas Abeel1Delft Bioinformatics Lab, Delft University of Technology Van MourikDelft Bioinformatics Lab, Delft University of Technology Van MourikAbstract Coronavirus disease 2019 (COVID-19) has emerged in December 2019 when the first case was reported in Wuhan, China and turned into a pandemic with 27 million (September 9th) cases. Currently, there are over 95,000 complete genome sequences of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus causing COVID-19, in public databases, accompanying a growing number of studies. Nevertheless, there is still much to learn about the viral population variation when the virus is evolving as it continues to spread. We have analyzed SARS-CoV-2 genomes to identify the most variant sites, as well as the stable, conserved ones in samples collected in the Netherlands until June 2020. We identified the most frequent mutations in different geographies. We also performed a phylogenetic study focused on the Netherlands to detect novel variants emerging in the late stages of the pandemic and forming local clusters. We investigated the S and N proteins on SARS-CoV-2 genomes in the Netherlands and found the most variant and stable sites to guide development of diagnostics assays and vaccines. We observed that while the SARS-CoV-2 genome has accumulated mutations, diverging from reference sequence, the variation landscape is dominated by four mutations globally, suggesting the current reference does not represent the virus samples circulating currently. In addition, we detected novel variants of SARS-CoV-2 almost unique to the Netherlands that form localized clusters and region-specific sub-populations indicating community spread. We explored SARS-CoV-2 variants in the Netherlands until June 2020 within a global context; our results provide insight into the viral population diversity for localized efforts in tracking the transmission of COVID-19, as well as sequenced-based approaches in diagnostics and therapeutics. We emphasize that little diversity is observed globally in recent samples despite the increased number of mutations relative to the established reference sequence. We suggest sequence-based analyses should opt for a consensus representation to adequately cover the genomic variation observed to speed up diagnostics and vaccine design.https://doi.org/10.1038/s41598-021-85363-7 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Aysun Urhan Thomas Abeel |
spellingShingle |
Aysun Urhan Thomas Abeel Emergence of novel SARS-CoV-2 variants in the Netherlands Scientific Reports |
author_facet |
Aysun Urhan Thomas Abeel |
author_sort |
Aysun Urhan |
title |
Emergence of novel SARS-CoV-2 variants in the Netherlands |
title_short |
Emergence of novel SARS-CoV-2 variants in the Netherlands |
title_full |
Emergence of novel SARS-CoV-2 variants in the Netherlands |
title_fullStr |
Emergence of novel SARS-CoV-2 variants in the Netherlands |
title_full_unstemmed |
Emergence of novel SARS-CoV-2 variants in the Netherlands |
title_sort |
emergence of novel sars-cov-2 variants in the netherlands |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-03-01 |
description |
Abstract Coronavirus disease 2019 (COVID-19) has emerged in December 2019 when the first case was reported in Wuhan, China and turned into a pandemic with 27 million (September 9th) cases. Currently, there are over 95,000 complete genome sequences of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus causing COVID-19, in public databases, accompanying a growing number of studies. Nevertheless, there is still much to learn about the viral population variation when the virus is evolving as it continues to spread. We have analyzed SARS-CoV-2 genomes to identify the most variant sites, as well as the stable, conserved ones in samples collected in the Netherlands until June 2020. We identified the most frequent mutations in different geographies. We also performed a phylogenetic study focused on the Netherlands to detect novel variants emerging in the late stages of the pandemic and forming local clusters. We investigated the S and N proteins on SARS-CoV-2 genomes in the Netherlands and found the most variant and stable sites to guide development of diagnostics assays and vaccines. We observed that while the SARS-CoV-2 genome has accumulated mutations, diverging from reference sequence, the variation landscape is dominated by four mutations globally, suggesting the current reference does not represent the virus samples circulating currently. In addition, we detected novel variants of SARS-CoV-2 almost unique to the Netherlands that form localized clusters and region-specific sub-populations indicating community spread. We explored SARS-CoV-2 variants in the Netherlands until June 2020 within a global context; our results provide insight into the viral population diversity for localized efforts in tracking the transmission of COVID-19, as well as sequenced-based approaches in diagnostics and therapeutics. We emphasize that little diversity is observed globally in recent samples despite the increased number of mutations relative to the established reference sequence. We suggest sequence-based analyses should opt for a consensus representation to adequately cover the genomic variation observed to speed up diagnostics and vaccine design. |
url |
https://doi.org/10.1038/s41598-021-85363-7 |
work_keys_str_mv |
AT aysunurhan emergenceofnovelsarscov2variantsinthenetherlands AT thomasabeel emergenceofnovelsarscov2variantsinthenetherlands |
_version_ |
1724199879604961280 |