A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction

Abstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database ( https://www.gisaid.org/ ), between Dec...

Full description

Bibliographic Details
Main Authors: Moses Effiong Ekpenyong, Mercy Ernest Edoho, Udoinyang Godwin Inyang, Faith-Michael Uzoka, Itemobong Samuel Ekaidem, Anietie Effiong Moses, Martins Ochubiojo Emeje, Youtchou Mirabeau Tatfeng, Ifiok James Udo, EnoAbasi Deborah Anwana, Oboso Edem Etim, Joseph Ikim Geoffery, Emmanuel Ambrose Dan
Format: Article
Language:English
Published: Nature Publishing Group 2021-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-93757-w
id doaj-2dbfd35eab264c17aa310891ba1970e6
record_format Article
spelling doaj-2dbfd35eab264c17aa310891ba1970e62021-07-18T11:26:40ZengNature Publishing GroupScientific Reports2045-23222021-07-0111112510.1038/s41598-021-93757-wA hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and predictionMoses Effiong Ekpenyong0Mercy Ernest Edoho1Udoinyang Godwin Inyang2Faith-Michael Uzoka3Itemobong Samuel Ekaidem4Anietie Effiong Moses5Martins Ochubiojo Emeje6Youtchou Mirabeau Tatfeng7Ifiok James Udo8EnoAbasi Deborah Anwana9Oboso Edem Etim10Joseph Ikim Geoffery11Emmanuel Ambrose Dan12Department of Computer Science, University of UyoDepartment of Computer Science, University of UyoDepartment of Computer Science, University of UyoDepartment of Mathematics and Computing, Mount Royal UniversityCollege of Health Sciences, University of UyoCollege of Health Sciences, University of UyoNational Institute for Pharmaceutical Research and Development (NIPRD)College of Health Sciences, Niger Delta UniversityDepartment of Computer Science, University of UyoDepartment of Botany and Ecological Studies, University of UyoDepartment of Biochemistry, University of UyoDepartment of Computer Science, University of UyoDepartment of Computer Science, University of UyoAbstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database ( https://www.gisaid.org/ ), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speak for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains transmission and sustain an increase in sub-strains within the various continents, with nucleotide mutations dynamically varying between individuals in close association with the virus as it adapts to its host/environment. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid, and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining helped the discovery of transmission routes and seamless contact tracing protocol. Our classification results were better than state-of-the-art methods, indicating a more robust system for predicting emerging or new viral sub-strain(s). The results therefore offer explanations for the growing concerns about the virus and its next wave(s). A future direction of this work is a defuzzification of confusable pattern clusters for precise intra-country SARS-CoV-2 sub-strains analytics.https://doi.org/10.1038/s41598-021-93757-w
collection DOAJ
language English
format Article
sources DOAJ
author Moses Effiong Ekpenyong
Mercy Ernest Edoho
Udoinyang Godwin Inyang
Faith-Michael Uzoka
Itemobong Samuel Ekaidem
Anietie Effiong Moses
Martins Ochubiojo Emeje
Youtchou Mirabeau Tatfeng
Ifiok James Udo
EnoAbasi Deborah Anwana
Oboso Edem Etim
Joseph Ikim Geoffery
Emmanuel Ambrose Dan
spellingShingle Moses Effiong Ekpenyong
Mercy Ernest Edoho
Udoinyang Godwin Inyang
Faith-Michael Uzoka
Itemobong Samuel Ekaidem
Anietie Effiong Moses
Martins Ochubiojo Emeje
Youtchou Mirabeau Tatfeng
Ifiok James Udo
EnoAbasi Deborah Anwana
Oboso Edem Etim
Joseph Ikim Geoffery
Emmanuel Ambrose Dan
A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
Scientific Reports
author_facet Moses Effiong Ekpenyong
Mercy Ernest Edoho
Udoinyang Godwin Inyang
Faith-Michael Uzoka
Itemobong Samuel Ekaidem
Anietie Effiong Moses
Martins Ochubiojo Emeje
Youtchou Mirabeau Tatfeng
Ifiok James Udo
EnoAbasi Deborah Anwana
Oboso Edem Etim
Joseph Ikim Geoffery
Emmanuel Ambrose Dan
author_sort Moses Effiong Ekpenyong
title A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
title_short A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
title_full A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
title_fullStr A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
title_full_unstemmed A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
title_sort hybrid computational framework for intelligent inter-continent sars-cov-2 sub-strains characterization and prediction
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-07-01
description Abstract Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database ( https://www.gisaid.org/ ), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speak for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains transmission and sustain an increase in sub-strains within the various continents, with nucleotide mutations dynamically varying between individuals in close association with the virus as it adapts to its host/environment. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid, and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining helped the discovery of transmission routes and seamless contact tracing protocol. Our classification results were better than state-of-the-art methods, indicating a more robust system for predicting emerging or new viral sub-strain(s). The results therefore offer explanations for the growing concerns about the virus and its next wave(s). A future direction of this work is a defuzzification of confusable pattern clusters for precise intra-country SARS-CoV-2 sub-strains analytics.
url https://doi.org/10.1038/s41598-021-93757-w
work_keys_str_mv AT moseseffiongekpenyong ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT mercyernestedoho ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT udoinyanggodwininyang ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT faithmichaeluzoka ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT itemobongsamuelekaidem ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT anietieeffiongmoses ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT martinsochubiojoemeje ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT youtchoumirabeautatfeng ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT ifiokjamesudo ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT enoabasideborahanwana ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT obosoedemetim ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT josephikimgeoffery ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT emmanuelambrosedan ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT moseseffiongekpenyong hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT mercyernestedoho hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT udoinyanggodwininyang hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT faithmichaeluzoka hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT itemobongsamuelekaidem hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT anietieeffiongmoses hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT martinsochubiojoemeje hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT youtchoumirabeautatfeng hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT ifiokjamesudo hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT enoabasideborahanwana hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT obosoedemetim hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT josephikimgeoffery hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
AT emmanuelambrosedan hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction
_version_ 1721296096483868672