Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data

<p class="0abstract">Classification and selection of gene in high dimensional microarray data has become a challenging problem in molecular biology and genetics. Penalized Adaptive likelihood method has been employed recently for classification of cancer to address both gene selectio...

Full description

Bibliographic Details
Main Authors: Isah Aliyu Kargi, Norazlina Bint Ismail, Ismail Bin Mohamad
Format: Article
Language:English
Published: International Association of Online Engineering (IAOE) 2021-08-01
Series:International Journal of Online and Biomedical Engineering
Subjects:
Online Access:https://online-journals.org/index.php/i-joe/article/view/24601
id doaj-fee2029076df4f5c88b8da3a5c0c385f
record_format Article
spelling doaj-fee2029076df4f5c88b8da3a5c0c385f2021-08-16T14:24:55ZengInternational Association of Online Engineering (IAOE)International Journal of Online and Biomedical Engineering2626-84932021-08-0117089110210.3991/ijoe.v17i08.246018315Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray DataIsah Aliyu Kargi0Norazlina Bint Ismail1Ismail Bin Mohamad2Department of Mathematics, Faculty of Science, Universiti Teknologi Malaysia, 81310 UTM Skudai, Johor, Malaysia. Department of Mathematics and Statistics Nuhu Bamalli Polytechnic p.m.b 1061, Zaria.Norazlina Bint Ismail Department of Mathematics, Faculty of Science, Universiti Teknologi Malaysia, 81310 UTM Skudai, Johor, Malaysia.Department of Mathematics, Faculty of Science, Universiti Teknologi Malaysia, 81310 UTM Skudai, Johor, Malaysia.<p class="0abstract">Classification and selection of gene in high dimensional microarray data has become a challenging problem in molecular biology and genetics. Penalized Adaptive likelihood method has been employed recently for classification of cancer to address both gene selection consistency and estimation of gene coefficients in high dimensional data simultaneously. Many studies from the literature have proposed the use of ordinary least squares (OLS), maximum likelihood estimation (MLE) and Elastic net as the initial weight in the Adaptive elastic net, but in high dimensional microarray data the MLE and OLS are not suitable. Likewise, considering the Elastic net as the initial weight in Adaptive elastic yields a poor performance, because the ridge penalty in the Elastic net grouped coefficient of highly correlated genes closer to each other.  As a result, the estimator fails to differentiate coefficients of highly correlated genes that have different sign being grouped together. To tackle this issue, the present study proposed Improved LASSO (ILASSO) estimator which add the ridge penalty to the original LASSO with an Adaptive weight to both    and  simultaneously. Results from the real data indicated that ILASSO has a better performance compared to other methods in terms of the number of gene selected, classification precision, Sensitivity and Specificity.</p>https://online-journals.org/index.php/i-joe/article/view/24601high dimension data, penalized adaptive elastic net, ilasso, logistic regression, gene selection, cancer classification
collection DOAJ
language English
format Article
sources DOAJ
author Isah Aliyu Kargi
Norazlina Bint Ismail
Ismail Bin Mohamad
spellingShingle Isah Aliyu Kargi
Norazlina Bint Ismail
Ismail Bin Mohamad
Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
International Journal of Online and Biomedical Engineering
high dimension data, penalized adaptive elastic net, ilasso, logistic regression, gene selection, cancer classification
author_facet Isah Aliyu Kargi
Norazlina Bint Ismail
Ismail Bin Mohamad
author_sort Isah Aliyu Kargi
title Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
title_short Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
title_full Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
title_fullStr Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
title_full_unstemmed Improved Lasso (ILASSO) for Gene Selection and Classification in High Dimensional DNA Microarray Data
title_sort improved lasso (ilasso) for gene selection and classification in high dimensional dna microarray data
publisher International Association of Online Engineering (IAOE)
series International Journal of Online and Biomedical Engineering
issn 2626-8493
publishDate 2021-08-01
description <p class="0abstract">Classification and selection of gene in high dimensional microarray data has become a challenging problem in molecular biology and genetics. Penalized Adaptive likelihood method has been employed recently for classification of cancer to address both gene selection consistency and estimation of gene coefficients in high dimensional data simultaneously. Many studies from the literature have proposed the use of ordinary least squares (OLS), maximum likelihood estimation (MLE) and Elastic net as the initial weight in the Adaptive elastic net, but in high dimensional microarray data the MLE and OLS are not suitable. Likewise, considering the Elastic net as the initial weight in Adaptive elastic yields a poor performance, because the ridge penalty in the Elastic net grouped coefficient of highly correlated genes closer to each other.  As a result, the estimator fails to differentiate coefficients of highly correlated genes that have different sign being grouped together. To tackle this issue, the present study proposed Improved LASSO (ILASSO) estimator which add the ridge penalty to the original LASSO with an Adaptive weight to both    and  simultaneously. Results from the real data indicated that ILASSO has a better performance compared to other methods in terms of the number of gene selected, classification precision, Sensitivity and Specificity.</p>
topic high dimension data, penalized adaptive elastic net, ilasso, logistic regression, gene selection, cancer classification
url https://online-journals.org/index.php/i-joe/article/view/24601
work_keys_str_mv AT isahaliyukargi improvedlassoilassoforgeneselectionandclassificationinhighdimensionaldnamicroarraydata
AT norazlinabintismail improvedlassoilassoforgeneselectionandclassificationinhighdimensionaldnamicroarraydata
AT ismailbinmohamad improvedlassoilassoforgeneselectionandclassificationinhighdimensionaldnamicroarraydata
_version_ 1721205831608827904