Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data

Background: In this study, we aimed to improve the reliability and biological interpretability of gene signatures selected from microarrays by efficient usage of computational models and mathematical algorithms. Methods: At the first step, a good model with high accuracy was chosen to predict cance...

Full description

Bibliographic Details
Main Authors: Mohammadreza Sehhati, Mina Kayed
Format: Article
Language:fas
Published: Vesnu Publications 2017-04-01
Series:مجله دانشکده پزشکی اصفهان
Subjects:
Online Access:http://jims.mui.ac.ir/index.php/jims/article/view/7469
id doaj-05a30274d2574a20a6fff6fed1a11e00
record_format Article
spelling doaj-05a30274d2574a20a6fff6fed1a11e002020-11-25T01:12:53ZfasVesnu Publications مجله دانشکده پزشکی اصفهان1027-75951735-854X2017-04-0111981032441Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray DataMohammadreza Sehhati0Mina Kayed1Assistant Professor, Department of Bioelectric and Biomedical Engineering, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, IranDepartment of Electrical Engineering, Sepahan Institute of Higher Education, Isfahan, IranBackground: In this study, we aimed to improve the reliability and biological interpretability of gene signatures selected from microarrays by efficient usage of computational models and mathematical algorithms. Methods: At the first step, a good model with high accuracy was chosen to predict cancer recurrence in microarray gene expression data on breast tumors. In this regard, microarray gene expression data of breast tumor in 1271 cancer patients (379 with recurrence and 892 people without recurrence) were utilized to construct an appropriate predictive model for recurrence by comparing the performance of multiple classifiers. In the pre-processing stage, different methods like correlation-based feature selection (CFS), principal component analysis (PCA), independent component analysis (ICA), and genetic algorithm as well as a random selection method were used to reduce the dimensions and choose the most appropriate genes (features). Findings: A total of five gene signatures were selected by combining genetic algorithm, top scoring set (TSS), and random selection method, which showed the best results in most classification models. The final indicator genes were TRIP13, KIF20A, NEK2, RACGAP1 and TYMS, which had significant contribution in the structure of microtubules and spindle and also regulated the attachment of spindle microtubules to kinetochore. Conclusion: By using hybrid models, we can avoid overfitting in training and achieve acceptable accuracy with biologically interpretable genes.http://jims.mui.ac.ir/index.php/jims/article/view/7469AlgorithmsBiomarkersBreast cancerClassificationGene expression profiling
collection DOAJ
language fas
format Article
sources DOAJ
author Mohammadreza Sehhati
Mina Kayed
spellingShingle Mohammadreza Sehhati
Mina Kayed
Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
مجله دانشکده پزشکی اصفهان
Algorithms
Biomarkers
Breast cancer
Classification
Gene expression profiling
author_facet Mohammadreza Sehhati
Mina Kayed
author_sort Mohammadreza Sehhati
title Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
title_short Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
title_full Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
title_fullStr Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
title_full_unstemmed Evaluation of Different Classification Models to Extract Gene Signatures for Breast Cancer Recurrence Using Microarray Data
title_sort evaluation of different classification models to extract gene signatures for breast cancer recurrence using microarray data
publisher Vesnu Publications
series مجله دانشکده پزشکی اصفهان
issn 1027-7595
1735-854X
publishDate 2017-04-01
description Background: In this study, we aimed to improve the reliability and biological interpretability of gene signatures selected from microarrays by efficient usage of computational models and mathematical algorithms. Methods: At the first step, a good model with high accuracy was chosen to predict cancer recurrence in microarray gene expression data on breast tumors. In this regard, microarray gene expression data of breast tumor in 1271 cancer patients (379 with recurrence and 892 people without recurrence) were utilized to construct an appropriate predictive model for recurrence by comparing the performance of multiple classifiers. In the pre-processing stage, different methods like correlation-based feature selection (CFS), principal component analysis (PCA), independent component analysis (ICA), and genetic algorithm as well as a random selection method were used to reduce the dimensions and choose the most appropriate genes (features). Findings: A total of five gene signatures were selected by combining genetic algorithm, top scoring set (TSS), and random selection method, which showed the best results in most classification models. The final indicator genes were TRIP13, KIF20A, NEK2, RACGAP1 and TYMS, which had significant contribution in the structure of microtubules and spindle and also regulated the attachment of spindle microtubules to kinetochore. Conclusion: By using hybrid models, we can avoid overfitting in training and achieve acceptable accuracy with biologically interpretable genes.
topic Algorithms
Biomarkers
Breast cancer
Classification
Gene expression profiling
url http://jims.mui.ac.ir/index.php/jims/article/view/7469
work_keys_str_mv AT mohammadrezasehhati evaluationofdifferentclassificationmodelstoextractgenesignaturesforbreastcancerrecurrenceusingmicroarraydata
AT minakayed evaluationofdifferentclassificationmodelstoextractgenesignaturesforbreastcancerrecurrenceusingmicroarraydata
_version_ 1725164479772098560