Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling

<p>Abstract</p> <p>Background</p> <p>The exact origin of the cause of the Severe Acute Respiratory Syndrome (SARS) is still an open question. The genomic sequence relationship of SARS-CoV with 30 different single-stranded RNA (<it>ss</it>RNA) viruses of vari...

Full description

Bibliographic Details
Main Authors: Danchin Antoine, Zhang Xue, Yap Yee
Format: Article
Language:English
Published: BMC 2003-09-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2105/4/43
id doaj-5b4060c10ea746d1a2d1b3b261f10794
record_format Article
spelling doaj-5b4060c10ea746d1a2d1b3b261f107942020-11-25T00:38:52ZengBMCBMC Bioinformatics1471-21052003-09-01414310.1186/1471-2105-4-43Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profilingDanchin AntoineZhang XueYap Yee<p>Abstract</p> <p>Background</p> <p>The exact origin of the cause of the Severe Acute Respiratory Syndrome (SARS) is still an open question. The genomic sequence relationship of SARS-CoV with 30 different single-stranded RNA (<it>ss</it>RNA) viruses of various families was studied using two non-standard approaches. Both approaches began with the vectorial profiling of the tetra-nucleotide usage pattern <it>V </it>for each virus. In approach one, a distance measure of a vector <it>V</it>, based on correlation coefficient was devised to construct a relationship tree by the neighbor-joining algorithm. In approach two, a multivariate factor analysis was performed to derive the embedded tetra-nucleotide usage patterns. These patterns were subsequently used to classify the selected viruses.</p> <p>Results</p> <p>Both approaches yielded relationship outcomes that are consistent with the known virus classification. They also indicated that the genome of RNA viruses from the same family conform to a specific pattern of word usage. Based on the correlation of the overall tetra-nucleotide usage patterns, the Transmissible Gastroenteritis Virus (TGV) and the Feline CoronaVirus (FCoV) are closest to SARS-CoV. Surprisingly also, the RNA viruses that do not go through a DNA stage displayed a remarkable discrimination against the CpG and UpA di-nucleotide (<it>z </it>= -77.31, -52.48 respectively) and selection for UpG and CpA (<it>z </it>= 65.79,49.99 respectively). Potential factors influencing these biases are discussed.</p> <p>Conclusion</p> <p>The study of genomic word usage is a powerful method to classify RNA viruses. The congruence of the relationship outcomes with the known classification indicates that there exist phylogenetic signals in the tetra-nucleotide usage patterns, that is most prominent in the replicase open reading frames.</p> http://www.biomedcentral.com/1471-2105/4/43RNA virusconvergent evolutionhorizontal gene transferfactor analysisSARS
collection DOAJ
language English
format Article
sources DOAJ
author Danchin Antoine
Zhang Xue
Yap Yee
spellingShingle Danchin Antoine
Zhang Xue
Yap Yee
Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
BMC Bioinformatics
RNA virus
convergent evolution
horizontal gene transfer
factor analysis
SARS
author_facet Danchin Antoine
Zhang Xue
Yap Yee
author_sort Danchin Antoine
title Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
title_short Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
title_full Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
title_fullStr Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
title_full_unstemmed Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling
title_sort relationship of sars-cov to other pathogenic rna viruses explored by tetranucleotide usage profiling
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2003-09-01
description <p>Abstract</p> <p>Background</p> <p>The exact origin of the cause of the Severe Acute Respiratory Syndrome (SARS) is still an open question. The genomic sequence relationship of SARS-CoV with 30 different single-stranded RNA (<it>ss</it>RNA) viruses of various families was studied using two non-standard approaches. Both approaches began with the vectorial profiling of the tetra-nucleotide usage pattern <it>V </it>for each virus. In approach one, a distance measure of a vector <it>V</it>, based on correlation coefficient was devised to construct a relationship tree by the neighbor-joining algorithm. In approach two, a multivariate factor analysis was performed to derive the embedded tetra-nucleotide usage patterns. These patterns were subsequently used to classify the selected viruses.</p> <p>Results</p> <p>Both approaches yielded relationship outcomes that are consistent with the known virus classification. They also indicated that the genome of RNA viruses from the same family conform to a specific pattern of word usage. Based on the correlation of the overall tetra-nucleotide usage patterns, the Transmissible Gastroenteritis Virus (TGV) and the Feline CoronaVirus (FCoV) are closest to SARS-CoV. Surprisingly also, the RNA viruses that do not go through a DNA stage displayed a remarkable discrimination against the CpG and UpA di-nucleotide (<it>z </it>= -77.31, -52.48 respectively) and selection for UpG and CpA (<it>z </it>= 65.79,49.99 respectively). Potential factors influencing these biases are discussed.</p> <p>Conclusion</p> <p>The study of genomic word usage is a powerful method to classify RNA viruses. The congruence of the relationship outcomes with the known classification indicates that there exist phylogenetic signals in the tetra-nucleotide usage patterns, that is most prominent in the replicase open reading frames.</p>
topic RNA virus
convergent evolution
horizontal gene transfer
factor analysis
SARS
url http://www.biomedcentral.com/1471-2105/4/43
work_keys_str_mv AT danchinantoine relationshipofsarscovtootherpathogenicrnavirusesexploredbytetranucleotideusageprofiling
AT zhangxue relationshipofsarscovtootherpathogenicrnavirusesexploredbytetranucleotideusageprofiling
AT yapyee relationshipofsarscovtootherpathogenicrnavirusesexploredbytetranucleotideusageprofiling
_version_ 1725296148021772288