Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies
Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, th...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Computational and Mathematical Methods in Medicine |
Online Access: | http://dx.doi.org/10.1155/2021/2203636 |
id |
doaj-0a93429bf20f424988488e5ce73a3d6a |
---|---|
record_format |
Article |
spelling |
doaj-0a93429bf20f424988488e5ce73a3d6a2021-10-11T00:39:39ZengHindawi LimitedComputational and Mathematical Methods in Medicine1748-67182021-01-01202110.1155/2021/2203636Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection StrategiesShuai Zhang0Renliang Qu1Pengyan Wang2Shenghan Wang3Department of Clinical LaboratoryDepartment of Clinical LaboratoryDepartment of Clinical LaboratoryDepartment of Microbiological LaboratoryCoronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease.http://dx.doi.org/10.1155/2021/2203636 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shuai Zhang Renliang Qu Pengyan Wang Shenghan Wang |
spellingShingle |
Shuai Zhang Renliang Qu Pengyan Wang Shenghan Wang Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies Computational and Mathematical Methods in Medicine |
author_facet |
Shuai Zhang Renliang Qu Pengyan Wang Shenghan Wang |
author_sort |
Shuai Zhang |
title |
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies |
title_short |
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies |
title_full |
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies |
title_fullStr |
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies |
title_full_unstemmed |
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies |
title_sort |
identification of novel covid-19 biomarkers by multiple feature selection strategies |
publisher |
Hindawi Limited |
series |
Computational and Mathematical Methods in Medicine |
issn |
1748-6718 |
publishDate |
2021-01-01 |
description |
Coronavirus disease 2019 (COVID-19) arising from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic since its first report in December 2019. So far, SARS-CoV-2 nucleic acid detection has been deemed as the golden standard of COVID-19 diagnosis. However, this detection method often leads to false negatives, thus triggering missed COVID-19 diagnosis. Therefore, it is urgent to find new biomarkers to increase the accuracy of COVID-19 diagnosis. To explore new biomarkers of COVID-19 in this study, expression profiles were firstly accessed from the GEO database. On this basis, 500 feature genes were screened by the minimum-redundancy maximum-relevancy (mRMR) feature selection method. Afterwards, the incremental feature selection (IFS) method was used to choose a classifier with the best performance from different feature gene-based support vector machine (SVM) classifiers. The corresponding 66 feature genes were set as the optimal feature genes. Lastly, the optimal feature genes were subjected to GO functional enrichment analysis, principal component analysis (PCA), and protein-protein interaction (PPI) network analysis. All in all, it was posited that the 66 feature genes could effectively classify positive and negative COVID-19 and work as new biomarkers of the disease. |
url |
http://dx.doi.org/10.1155/2021/2203636 |
work_keys_str_mv |
AT shuaizhang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies AT renliangqu identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies AT pengyanwang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies AT shenghanwang identificationofnovelcovid19biomarkersbymultiplefeatureselectionstrategies |
_version_ |
1716829086488199168 |