Dimension reduction methods for microarray data: a review
Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2017-03-01
|
Series: | AIMS Bioengineering |
Subjects: | |
Online Access: | http://www.aimspress.com/Bioengineering/article/1315/fulltext.html |
id |
doaj-01b689ef5e8d451e986928a541fad595 |
---|---|
record_format |
Article |
spelling |
doaj-01b689ef5e8d451e986928a541fad5952020-11-24T21:47:27ZengAIMS PressAIMS Bioengineering2375-14952017-03-014117919710.3934/bioeng.2017.1.179bioeng-04-00179Dimension reduction methods for microarray data: a reviewRabia Aziz0C.K. Verma1Namita Srivastava2Department of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDepartment of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDepartment of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.http://www.aimspress.com/Bioengineering/article/1315/fulltext.htmlDNA microarraysdimension reductionclassificationprediction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rabia Aziz C.K. Verma Namita Srivastava |
spellingShingle |
Rabia Aziz C.K. Verma Namita Srivastava Dimension reduction methods for microarray data: a review AIMS Bioengineering DNA microarrays dimension reduction classification prediction |
author_facet |
Rabia Aziz C.K. Verma Namita Srivastava |
author_sort |
Rabia Aziz |
title |
Dimension reduction methods for microarray data: a review |
title_short |
Dimension reduction methods for microarray data: a review |
title_full |
Dimension reduction methods for microarray data: a review |
title_fullStr |
Dimension reduction methods for microarray data: a review |
title_full_unstemmed |
Dimension reduction methods for microarray data: a review |
title_sort |
dimension reduction methods for microarray data: a review |
publisher |
AIMS Press |
series |
AIMS Bioengineering |
issn |
2375-1495 |
publishDate |
2017-03-01 |
description |
Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years. |
topic |
DNA microarrays dimension reduction classification prediction |
url |
http://www.aimspress.com/Bioengineering/article/1315/fulltext.html |
work_keys_str_mv |
AT rabiaaziz dimensionreductionmethodsformicroarraydataareview AT ckverma dimensionreductionmethodsformicroarraydataareview AT namitasrivastava dimensionreductionmethodsformicroarraydataareview |
_version_ |
1725896789771419648 |