Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies

Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, th...

Full description

Bibliographic Details
Main Authors: Shuai Zhang, Renliang Qu, Pengyan Wang, Shenghan Wang
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Computational and Mathematical Methods in Medicine
Online Access:http://dx.doi.org/10.1155/2021/2203636
id doaj-0a93429bf20f424988488e5ce73a3d6a
record_format Article
spelling doaj-0a93429bf20f424988488e5ce73a3d6a2021-10-11T00:39:39ZengHindawi LimitedComputational and Mathematical Methods in Medicine1748-67182021-01-01202110.1155/2021/2203636Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection StrategiesShuai Zhang0Renliang Qu1Pengyan Wang2Shenghan Wang3Department of Clinical LaboratoryDepartment of Clinical LaboratoryDepartment of Clinical LaboratoryDepartment of Microbiological LaboratoryCoronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease.http://dx.doi.org/10.1155/2021/2203636
collection DOAJ
language English
format Article
sources DOAJ
author Shuai Zhang
Renliang Qu
Pengyan Wang
Shenghan Wang
spellingShingle Shuai Zhang
Renliang Qu
Pengyan Wang
Shenghan Wang
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
Computational and Mathematical Methods in Medicine
author_facet Shuai Zhang
Renliang Qu
Pengyan Wang
Shenghan Wang
author_sort Shuai Zhang
title Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_short Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_full Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_fullStr Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_full_unstemmed Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
title_sort identification of novel covid-19 biomarkers by multiple feature selection strategies
publisher Hindawi Limited
series Computational and Mathematical Methods in Medicine
issn 1748-6718
publishDate 2021-01-01
description Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease.
url http://dx.doi.org/10.1155/2021/2203636
work_keys_str_mv AT shuaizhang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT renliangqu identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT pengyanwang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
AT shenghanwang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies
_version_ 1716829086488199168