Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method

All over the world, breast cancer is the second leading cause of death in women above 40 years of age. To design an efficient classification system for breast cancer diagnosis, one has to use efficient algorithms for feature selection to reduce the feature space of mammogram classification. The curr...

Full description

Bibliographic Details
Main Authors: Thawkar Shankar, Ingolikar Ranjana
Format: Article
Language:English
Published: De Gruyter 2018-07-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2018-0091
id doaj-81ea4bdcff9747eab867fa573a5307d5
record_format Article
spelling doaj-81ea4bdcff9747eab867fa573a5307d52021-09-06T19:40:38ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2018-07-0129183184510.1515/jisys-2018-0091Classification of Masses in Digital Mammograms Using the Genetic Ensemble MethodThawkar Shankar0Ingolikar Ranjana1Electronics and Computer Science, Rashtrasant Tukadoji Maharaj Nagpur University, Nagpur, Maharashtra, IndiaDepartment of Computer Science, S. F. S College, Nagpur, Maharashtra, IndiaAll over the world, breast cancer is the second leading cause of death in women above 40 years of age. To design an efficient classification system for breast cancer diagnosis, one has to use efficient algorithms for feature selection to reduce the feature space of mammogram classification. The current work investigates the use of hybrid genetic ensemble method for feature selection and classification of masses. Genetic algorithm (GA) is used to select a subset of features and to evaluate the fitness of the selected features, Adaptive boosting (AdaBoost) and Random Forest (RF) ensembles with 10-fold cross-validation are employed. The selected features are used to classify masses into benign or malignant using AdaBoost, RF, and single Decision Tree (DT) classifiers. The performance evaluation of classifiers indicates that AdaBoost outperforms both RF and single DT classifiers. AdaBoost achieves an accuracy of 96.15%, with 97.32% sensitivity, 95.90% specificity, and area under curve of AZ = 0.982 ± 0.004. The results obtained with the proposed method are better when compared with extant research work.https://doi.org/10.1515/jisys-2018-0091digital mammographydecision treefeature selectionclassificationgenetic algorithmensemblesadaboostrandom forestreceiver operating characteristics curve
collection DOAJ
language English
format Article
sources DOAJ
author Thawkar Shankar
Ingolikar Ranjana
spellingShingle Thawkar Shankar
Ingolikar Ranjana
Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
Journal of Intelligent Systems
digital mammography
decision tree
feature selection
classification
genetic algorithm
ensembles
adaboost
random forest
receiver operating characteristics curve
author_facet Thawkar Shankar
Ingolikar Ranjana
author_sort Thawkar Shankar
title Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
title_short Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
title_full Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
title_fullStr Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
title_full_unstemmed Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
title_sort classification of masses in digital mammograms using the genetic ensemble method
publisher De Gruyter
series Journal of Intelligent Systems
issn 0334-1860
2191-026X
publishDate 2018-07-01
description All over the world, breast cancer is the second leading cause of death in women above 40 years of age. To design an efficient classification system for breast cancer diagnosis, one has to use efficient algorithms for feature selection to reduce the feature space of mammogram classification. The current work investigates the use of hybrid genetic ensemble method for feature selection and classification of masses. Genetic algorithm (GA) is used to select a subset of features and to evaluate the fitness of the selected features, Adaptive boosting (AdaBoost) and Random Forest (RF) ensembles with 10-fold cross-validation are employed. The selected features are used to classify masses into benign or malignant using AdaBoost, RF, and single Decision Tree (DT) classifiers. The performance evaluation of classifiers indicates that AdaBoost outperforms both RF and single DT classifiers. AdaBoost achieves an accuracy of 96.15%, with 97.32% sensitivity, 95.90% specificity, and area under curve of AZ = 0.982 ± 0.004. The results obtained with the proposed method are better when compared with extant research work.
topic digital mammography
decision tree
feature selection
classification
genetic algorithm
ensembles
adaboost
random forest
receiver operating characteristics curve
url https://doi.org/10.1515/jisys-2018-0091
work_keys_str_mv AT thawkarshankar classificationofmassesindigitalmammogramsusingthegeneticensemblemethod
AT ingolikarranjana classificationofmassesindigitalmammogramsusingthegeneticensemblemethod
_version_ 1717768024844402688