Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses

Distinct patterns of dinucleotide representation, such as CpG and UpA suppression, are characteristic of certain viral genomes. Recent research has uncovered vertebrate immune mechanisms that select against specific dinucleotides in targeted viruses. This evidence highlights the importance of system...

Full description

Bibliographic Details
Main Authors: Spyros Lytras, Joseph Hughes
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Viruses
Subjects:
Online Access:https://www.mdpi.com/1999-4915/12/4/462
id doaj-5b9a338d30084730a762715a3dc239ea
record_format Article
spelling doaj-5b9a338d30084730a762715a3dc239ea2020-11-25T01:43:18ZengMDPI AGViruses1999-49152020-04-011246246210.3390/v12040462Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in VirusesSpyros Lytras0Joseph Hughes1MRC—University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UKMRC—University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UKDistinct patterns of dinucleotide representation, such as CpG and UpA suppression, are characteristic of certain viral genomes. Recent research has uncovered vertebrate immune mechanisms that select against specific dinucleotides in targeted viruses. This evidence highlights the importance of systematically examining the dinucleotide composition of viral genomes. We have developed a novel metric, called synonymous dinucleotide usage (SDU), for quantifying dinucleotide representation in coding sequences. Our method compares the abundance of a given dinucleotide to the null hypothesis of equal synonymous codon usage in the sequence. We present a Python3 package, <i>DinuQ</i>, for calculating SDU and other relevant metrics. We have applied this method on two sets of invertebrate- and vertebrate-specific flaviviruses and rhabdoviruses. The SDU shows that the vertebrate viruses exhibit consistently greater under-representation of CpG dinucleotides in all three codon positions in both datasets. In comparison to existing metrics for dinucleotide quantification, the SDU allows for a statistical interpretation of its values by comparing it to a null expectation based on the codon table. Here we apply the method to viruses, but coding sequences of other living organisms can be analysed in the same way.https://www.mdpi.com/1999-4915/12/4/462dinucleotidesCpG suppression<i>Flaviviridae</i><i>Rhabdoviridae</i>synonymous codon usagebioinformatics
collection DOAJ
language English
format Article
sources DOAJ
author Spyros Lytras
Joseph Hughes
spellingShingle Spyros Lytras
Joseph Hughes
Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
Viruses
dinucleotides
CpG suppression
<i>Flaviviridae</i>
<i>Rhabdoviridae</i>
synonymous codon usage
bioinformatics
author_facet Spyros Lytras
Joseph Hughes
author_sort Spyros Lytras
title Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
title_short Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
title_full Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
title_fullStr Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
title_full_unstemmed Synonymous Dinucleotide Usage: A Codon-Aware Metric for Quantifying Dinucleotide Representation in Viruses
title_sort synonymous dinucleotide usage: a codon-aware metric for quantifying dinucleotide representation in viruses
publisher MDPI AG
series Viruses
issn 1999-4915
publishDate 2020-04-01
description Distinct patterns of dinucleotide representation, such as CpG and UpA suppression, are characteristic of certain viral genomes. Recent research has uncovered vertebrate immune mechanisms that select against specific dinucleotides in targeted viruses. This evidence highlights the importance of systematically examining the dinucleotide composition of viral genomes. We have developed a novel metric, called synonymous dinucleotide usage (SDU), for quantifying dinucleotide representation in coding sequences. Our method compares the abundance of a given dinucleotide to the null hypothesis of equal synonymous codon usage in the sequence. We present a Python3 package, <i>DinuQ</i>, for calculating SDU and other relevant metrics. We have applied this method on two sets of invertebrate- and vertebrate-specific flaviviruses and rhabdoviruses. The SDU shows that the vertebrate viruses exhibit consistently greater under-representation of CpG dinucleotides in all three codon positions in both datasets. In comparison to existing metrics for dinucleotide quantification, the SDU allows for a statistical interpretation of its values by comparing it to a null expectation based on the codon table. Here we apply the method to viruses, but coding sequences of other living organisms can be analysed in the same way.
topic dinucleotides
CpG suppression
<i>Flaviviridae</i>
<i>Rhabdoviridae</i>
synonymous codon usage
bioinformatics
url https://www.mdpi.com/1999-4915/12/4/462
work_keys_str_mv AT spyroslytras synonymousdinucleotideusageacodonawaremetricforquantifyingdinucleotiderepresentationinviruses
AT josephhughes synonymousdinucleotideusageacodonawaremetricforquantifyingdinucleotiderepresentationinviruses
_version_ 1725032190221221888