Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
Hepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing ac...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-03-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/13/3/73 |
id |
doaj-9d7a9ae7f7a74a978ea5bc65d3e64cb2 |
---|---|
record_format |
Article |
spelling |
doaj-9d7a9ae7f7a74a978ea5bc65d3e64cb22020-11-25T02:04:40ZengMDPI AGAlgorithms1999-48932020-03-011337310.3390/a13030073a13030073Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCVNagwan M. Abdel Samee0Information Technology Department, College of Computer & Information Sciences, Princess Nourah bint Abdulrahman University, 11671 Riyadh, Saudi ArabiaHepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing accurate antivirals. In this work, there are two phases: detecting the up/downregulated genes using classical univariate and multivariate feature selection methods, and validating the retrieved list of genes using Insilico classifiers. However, the classification algorithms in the medical domain frequently suffer from a deficiency of training cases. Therefore, a deep neural network approach is proposed here to validate the significance of the retrieved genes in classifying the HCV-infected samples from the disinfected ones. The validation model is based on the artificial generation of new examples from the retrieved genes’ expressions using sparse autoencoders. Subsequently, the generated genes’ expressions data are used to train conventional classifiers. Our results in the first phase yielded a better retrieval of significant genes using Principal Component Analysis (PCA), a multivariate approach. The retrieved list of genes using PCA had a higher number of HCC biomarkers compared to the ones retrieved from the univariate methods. In the second phase, the classification accuracy can reveal the relevance of the extracted key genes in classifying the HCV-infected and disinfected samples.https://www.mdpi.com/1999-4893/13/3/73key geneshcchcvclassical machine learningdeep learningautoencoders |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nagwan M. Abdel Samee |
spellingShingle |
Nagwan M. Abdel Samee Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV Algorithms key genes hcc hcv classical machine learning deep learning autoencoders |
author_facet |
Nagwan M. Abdel Samee |
author_sort |
Nagwan M. Abdel Samee |
title |
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV |
title_short |
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV |
title_full |
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV |
title_fullStr |
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV |
title_full_unstemmed |
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV |
title_sort |
classical and deep learning paradigms for detection and validation of key genes of risky outcomes of hcv |
publisher |
MDPI AG |
series |
Algorithms |
issn |
1999-4893 |
publishDate |
2020-03-01 |
description |
Hepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing accurate antivirals. In this work, there are two phases: detecting the up/downregulated genes using classical univariate and multivariate feature selection methods, and validating the retrieved list of genes using Insilico classifiers. However, the classification algorithms in the medical domain frequently suffer from a deficiency of training cases. Therefore, a deep neural network approach is proposed here to validate the significance of the retrieved genes in classifying the HCV-infected samples from the disinfected ones. The validation model is based on the artificial generation of new examples from the retrieved genes’ expressions using sparse autoencoders. Subsequently, the generated genes’ expressions data are used to train conventional classifiers. Our results in the first phase yielded a better retrieval of significant genes using Principal Component Analysis (PCA), a multivariate approach. The retrieved list of genes using PCA had a higher number of HCC biomarkers compared to the ones retrieved from the univariate methods. In the second phase, the classification accuracy can reveal the relevance of the extracted key genes in classifying the HCV-infected and disinfected samples. |
topic |
key genes hcc hcv classical machine learning deep learning autoencoders |
url |
https://www.mdpi.com/1999-4893/13/3/73 |
work_keys_str_mv |
AT nagwanmabdelsamee classicalanddeeplearningparadigmsfordetectionandvalidationofkeygenesofriskyoutcomesofhcv |
_version_ |
1724941837476560896 |