Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data

Abstract Background Finding the tumor location in the prostate is an essential pathological step for prostate cancer diagnosis and treatment. The location of the tumor – the laterality – can be unilateral (the tumor is affecting one side of the prostate), or bilateral on both sides. Nevertheless, th...

Full description

Bibliographic Details
Main Authors: Osama Hamzeh, Abedalrhman Alkhateeb, Julia Zheng, Srinath Kandalam, Luis Rueda
Format: Article
Language:English
Published: BMC 2020-03-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-3345-9
id doaj-f0d8c9a5d9dd4b2a909200a6f553877b
record_format Article
spelling doaj-f0d8c9a5d9dd4b2a909200a6f553877b2020-11-25T03:33:09ZengBMCBMC Bioinformatics1471-21052020-03-0121S211010.1186/s12859-020-3345-9Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression dataOsama Hamzeh0Abedalrhman Alkhateeb1Julia Zheng2Srinath Kandalam3Luis Rueda4School of Computer Science, University of WindsorSchool of Computer Science, University of WindsorSchool of Computer Science, University of WindsorDepartment of Biomedical Sciences, University of WindsorSchool of Computer Science, University of WindsorAbstract Background Finding the tumor location in the prostate is an essential pathological step for prostate cancer diagnosis and treatment. The location of the tumor – the laterality – can be unilateral (the tumor is affecting one side of the prostate), or bilateral on both sides. Nevertheless, the tumor can be overestimated or underestimated by standard screening methods. In this work, a combination of efficient machine learning methods for feature selection and classification are proposed to analyze gene activity and select them as relevant biomarkers for different laterality samples. Results A data set that consists of 450 samples was used in this study. The samples were divided into three laterality classes (left, right, bilateral). The aim of this work is to understand the genomic activity in each class and find relevant genes as indicators for each class with nearly 99% accuracy. The system identified groups of differentially expressed genes (RTN1, HLA-DMB, MRI1) that are able to differentiate samples among the three classes. Conclusion The proposed method was able to detect sets of genes that can identify different laterality classes. The resulting genes are found to be strongly correlated with disease progression. HLA-DMB and EIF4G2, which are detected in the set of genes can detect the left laterality, were reported earlier to be in the same pathway called Allograft rejection SuperPath.http://link.springer.com/article/10.1186/s12859-020-3345-9Machine learningClassificationBiomarkersProstate cancer laterality
collection DOAJ
language English
format Article
sources DOAJ
author Osama Hamzeh
Abedalrhman Alkhateeb
Julia Zheng
Srinath Kandalam
Luis Rueda
spellingShingle Osama Hamzeh
Abedalrhman Alkhateeb
Julia Zheng
Srinath Kandalam
Luis Rueda
Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
BMC Bioinformatics
Machine learning
Classification
Biomarkers
Prostate cancer laterality
author_facet Osama Hamzeh
Abedalrhman Alkhateeb
Julia Zheng
Srinath Kandalam
Luis Rueda
author_sort Osama Hamzeh
title Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
title_short Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
title_full Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
title_fullStr Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
title_full_unstemmed Prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
title_sort prediction of tumor location in prostate cancer tissue using a machine learning system on gene expression data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-03-01
description Abstract Background Finding the tumor location in the prostate is an essential pathological step for prostate cancer diagnosis and treatment. The location of the tumor – the laterality – can be unilateral (the tumor is affecting one side of the prostate), or bilateral on both sides. Nevertheless, the tumor can be overestimated or underestimated by standard screening methods. In this work, a combination of efficient machine learning methods for feature selection and classification are proposed to analyze gene activity and select them as relevant biomarkers for different laterality samples. Results A data set that consists of 450 samples was used in this study. The samples were divided into three laterality classes (left, right, bilateral). The aim of this work is to understand the genomic activity in each class and find relevant genes as indicators for each class with nearly 99% accuracy. The system identified groups of differentially expressed genes (RTN1, HLA-DMB, MRI1) that are able to differentiate samples among the three classes. Conclusion The proposed method was able to detect sets of genes that can identify different laterality classes. The resulting genes are found to be strongly correlated with disease progression. HLA-DMB and EIF4G2, which are detected in the set of genes can detect the left laterality, were reported earlier to be in the same pathway called Allograft rejection SuperPath.
topic Machine learning
Classification
Biomarkers
Prostate cancer laterality
url http://link.springer.com/article/10.1186/s12859-020-3345-9
work_keys_str_mv AT osamahamzeh predictionoftumorlocationinprostatecancertissueusingamachinelearningsystemongeneexpressiondata
AT abedalrhmanalkhateeb predictionoftumorlocationinprostatecancertissueusingamachinelearningsystemongeneexpressiondata
AT juliazheng predictionoftumorlocationinprostatecancertissueusingamachinelearningsystemongeneexpressiondata
AT srinathkandalam predictionoftumorlocationinprostatecancertissueusingamachinelearningsystemongeneexpressiondata
AT luisrueda predictionoftumorlocationinprostatecancertissueusingamachinelearningsystemongeneexpressiondata
_version_ 1724564362460397568