Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19

Motivation: Recently, the outbreak of Coronavirus-Covid-19 has forced the World Health Organization to declare a pandemic status. A genome sequence is the core of this virus which interferes with the normal activities of its counterparts within humans. Analysis of its genome may provide clues toward...

Full description

Bibliographic Details
Main Authors: Mahmoud Naghibzadeh, Hossein Savari, Abdorreza Savadi, Nayyereh Saadati, Elahe Mehrazin
Format: Article
Language:English
Published: Elsevier 2020-01-01
Series:Informatics in Medicine Unlocked
Online Access:http://www.sciencedirect.com/science/article/pii/S2352914820302562
id doaj-9516929ae969445f9ee744bd97d3ad4a
record_format Article
spelling doaj-9516929ae969445f9ee744bd97d3ad4a2020-11-25T03:09:29ZengElsevierInformatics in Medicine Unlocked2352-91482020-01-0119100356Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19Mahmoud Naghibzadeh0Hossein Savari1Abdorreza Savadi2Nayyereh Saadati3Elahe Mehrazin4Knowledge Engineering Research Group, Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, Iran; High Performance Computing Lab., Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, Iran; Corresponding author. Knowledge Engineering Research Group, Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, Iran.Knowledge Engineering Research Group, Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, IranHigh Performance Computing Lab., Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, IranGhaem Hospital, Department of Internal Medicine, Mashhad University of Medical Sciences, Mashhad, IranKnowledge Engineering Research Group, Computer Engineering Dept., Ferdowsi University of Mashhad, Mashhad, IranMotivation: Recently, the outbreak of Coronavirus-Covid-19 has forced the World Health Organization to declare a pandemic status. A genome sequence is the core of this virus which interferes with the normal activities of its counterparts within humans. Analysis of its genome may provide clues toward the proper treatment of patients and the design of new drugs and vaccines. Microsatellites are composed of short genome subsequences which are successively repeated many times in the same direction. They are highly variable in terms of their building blocks, number of repeats, and their locations in the genome sequences. This mutability property has been the source of many diseases. Usually the host genome is analyzed to diagnose possible diseases in the victim. In this research, the focus is concentrated on the attacker's genome for discovery of its malicious properties. Results: The focus of this research is the microsatellites of both SARS and Covid-19. An accurate and highly efficient computer method for identifying all microsatellites in the genome sequences is discovered and implemented, and it is used to find all microsatellites in the Coronavirus-Covid-19 and SARS2003. The Microsatellite discovery is based on an efficient indexing technique called K-Mer Hash Indexing. The method is called Fast Microsatellite Discovery (FMSD) and it is used for both SARS and Covid-19. A table composed of all microsatellites is reported. There are many differences between SARS and Covid-19, but there is an outstanding difference which requires further investigation. Availability: FMSD is freely available at https://gitlab.com/FUM_HPCLab/fmsd_project, implemented in C on Linux-Ubuntu system. Software related contact: hossein_savari@mail.um.ac.ir.http://www.sciencedirect.com/science/article/pii/S2352914820302562
collection DOAJ
language English
format Article
sources DOAJ
author Mahmoud Naghibzadeh
Hossein Savari
Abdorreza Savadi
Nayyereh Saadati
Elahe Mehrazin
spellingShingle Mahmoud Naghibzadeh
Hossein Savari
Abdorreza Savadi
Nayyereh Saadati
Elahe Mehrazin
Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
Informatics in Medicine Unlocked
author_facet Mahmoud Naghibzadeh
Hossein Savari
Abdorreza Savadi
Nayyereh Saadati
Elahe Mehrazin
author_sort Mahmoud Naghibzadeh
title Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
title_short Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
title_full Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
title_fullStr Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
title_full_unstemmed Developing an ultra-efficient microsatellite discoverer to find structural differences between SARS-CoV-1 and Covid-19
title_sort developing an ultra-efficient microsatellite discoverer to find structural differences between sars-cov-1 and covid-19
publisher Elsevier
series Informatics in Medicine Unlocked
issn 2352-9148
publishDate 2020-01-01
description Motivation: Recently, the outbreak of Coronavirus-Covid-19 has forced the World Health Organization to declare a pandemic status. A genome sequence is the core of this virus which interferes with the normal activities of its counterparts within humans. Analysis of its genome may provide clues toward the proper treatment of patients and the design of new drugs and vaccines. Microsatellites are composed of short genome subsequences which are successively repeated many times in the same direction. They are highly variable in terms of their building blocks, number of repeats, and their locations in the genome sequences. This mutability property has been the source of many diseases. Usually the host genome is analyzed to diagnose possible diseases in the victim. In this research, the focus is concentrated on the attacker's genome for discovery of its malicious properties. Results: The focus of this research is the microsatellites of both SARS and Covid-19. An accurate and highly efficient computer method for identifying all microsatellites in the genome sequences is discovered and implemented, and it is used to find all microsatellites in the Coronavirus-Covid-19 and SARS2003. The Microsatellite discovery is based on an efficient indexing technique called K-Mer Hash Indexing. The method is called Fast Microsatellite Discovery (FMSD) and it is used for both SARS and Covid-19. A table composed of all microsatellites is reported. There are many differences between SARS and Covid-19, but there is an outstanding difference which requires further investigation. Availability: FMSD is freely available at https://gitlab.com/FUM_HPCLab/fmsd_project, implemented in C on Linux-Ubuntu system. Software related contact: hossein_savari@mail.um.ac.ir.
url http://www.sciencedirect.com/science/article/pii/S2352914820302562
work_keys_str_mv AT mahmoudnaghibzadeh developinganultraefficientmicrosatellitediscoverertofindstructuraldifferencesbetweensarscov1andcovid19
AT hosseinsavari developinganultraefficientmicrosatellitediscoverertofindstructuraldifferencesbetweensarscov1andcovid19
AT abdorrezasavadi developinganultraefficientmicrosatellitediscoverertofindstructuraldifferencesbetweensarscov1andcovid19
AT nayyerehsaadati developinganultraefficientmicrosatellitediscoverertofindstructuraldifferencesbetweensarscov1andcovid19
AT elahemehrazin developinganultraefficientmicrosatellitediscoverertofindstructuraldifferencesbetweensarscov1andcovid19
_version_ 1724662358227288064