Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV

Hepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing ac...

Full description

Bibliographic Details
Main Author: Nagwan M. Abdel Samee
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Algorithms
Subjects:
hcc
hcv
Online Access:https://www.mdpi.com/1999-4893/13/3/73
id doaj-9d7a9ae7f7a74a978ea5bc65d3e64cb2
record_format Article
spelling doaj-9d7a9ae7f7a74a978ea5bc65d3e64cb22020-11-25T02:04:40ZengMDPI AGAlgorithms1999-48932020-03-011337310.3390/a13030073a13030073Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCVNagwan M. Abdel Samee0Information Technology Department, College of Computer & Information Sciences, Princess Nourah bint Abdulrahman University, 11671 Riyadh, Saudi ArabiaHepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing accurate antivirals. In this work, there are two phases: detecting the up/downregulated genes using classical univariate and multivariate feature selection methods, and validating the retrieved list of genes using Insilico classifiers. However, the classification algorithms in the medical domain frequently suffer from a deficiency of training cases. Therefore, a deep neural network approach is proposed here to validate the significance of the retrieved genes in classifying the HCV-infected samples from the disinfected ones. The validation model is based on the artificial generation of new examples from the retrieved genes’ expressions using sparse autoencoders. Subsequently, the generated genes’ expressions data are used to train conventional classifiers. Our results in the first phase yielded a better retrieval of significant genes using Principal Component Analysis (PCA), a multivariate approach. The retrieved list of genes using PCA had a higher number of HCC biomarkers compared to the ones retrieved from the univariate methods. In the second phase, the classification accuracy can reveal the relevance of the extracted key genes in classifying the HCV-infected and disinfected samples.https://www.mdpi.com/1999-4893/13/3/73key geneshcchcvclassical machine learningdeep learningautoencoders
collection DOAJ
language English
format Article
sources DOAJ
author Nagwan M. Abdel Samee
spellingShingle Nagwan M. Abdel Samee
Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
Algorithms
key genes
hcc
hcv
classical machine learning
deep learning
autoencoders
author_facet Nagwan M. Abdel Samee
author_sort Nagwan M. Abdel Samee
title Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
title_short Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
title_full Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
title_fullStr Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
title_full_unstemmed Classical and Deep Learning Paradigms for Detection and Validation of Key Genes of Risky Outcomes of HCV
title_sort classical and deep learning paradigms for detection and validation of key genes of risky outcomes of hcv
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2020-03-01
description Hepatitis C virus (HCV) is one of the most dangerous viruses worldwide. It is the foremost cause of the hepatic cirrhosis, and hepatocellular carcinoma, HCC. Detecting new key genes that play a role in the growth of HCC in HCV patients using machine learning techniques paves the way for producing accurate antivirals. In this work, there are two phases: detecting the up/downregulated genes using classical univariate and multivariate feature selection methods, and validating the retrieved list of genes using Insilico classifiers. However, the classification algorithms in the medical domain frequently suffer from a deficiency of training cases. Therefore, a deep neural network approach is proposed here to validate the significance of the retrieved genes in classifying the HCV-infected samples from the disinfected ones. The validation model is based on the artificial generation of new examples from the retrieved genes’ expressions using sparse autoencoders. Subsequently, the generated genes’ expressions data are used to train conventional classifiers. Our results in the first phase yielded a better retrieval of significant genes using Principal Component Analysis (PCA), a multivariate approach. The retrieved list of genes using PCA had a higher number of HCC biomarkers compared to the ones retrieved from the univariate methods. In the second phase, the classification accuracy can reveal the relevance of the extracted key genes in classifying the HCV-infected and disinfected samples.
topic key genes
hcc
hcv
classical machine learning
deep learning
autoencoders
url https://www.mdpi.com/1999-4893/13/3/73
work_keys_str_mv AT nagwanmabdelsamee classicalanddeeplearningparadigmsfordetectionandvalidationofkeygenesofriskyoutcomesofhcv
_version_ 1724941837476560896