Dimension reduction methods for microarray data: a review

Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small...

Full description

Bibliographic Details
Main Authors: Rabia Aziz, C.K. Verma, Namita Srivastava
Format: Article
Language:English
Published: AIMS Press 2017-03-01
Series:AIMS Bioengineering
Subjects:
Online Access:http://www.aimspress.com/Bioengineering/article/1315/fulltext.html
id doaj-01b689ef5e8d451e986928a541fad595
record_format Article
spelling doaj-01b689ef5e8d451e986928a541fad5952020-11-24T21:47:27ZengAIMS PressAIMS Bioengineering2375-14952017-03-014117919710.3934/bioeng.2017.1.179bioeng-04-00179Dimension reduction methods for microarray data: a reviewRabia Aziz0C.K. Verma1Namita Srivastava2Department of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDepartment of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDepartment of Mathematics & Computer Application, Maulana Azad National Institute of Technology Bhopal-462003 (M.P.) IndiaDimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.http://www.aimspress.com/Bioengineering/article/1315/fulltext.htmlDNA microarraysdimension reductionclassificationprediction
collection DOAJ
language English
format Article
sources DOAJ
author Rabia Aziz
C.K. Verma
Namita Srivastava
spellingShingle Rabia Aziz
C.K. Verma
Namita Srivastava
Dimension reduction methods for microarray data: a review
AIMS Bioengineering
DNA microarrays
dimension reduction
classification
prediction
author_facet Rabia Aziz
C.K. Verma
Namita Srivastava
author_sort Rabia Aziz
title Dimension reduction methods for microarray data: a review
title_short Dimension reduction methods for microarray data: a review
title_full Dimension reduction methods for microarray data: a review
title_fullStr Dimension reduction methods for microarray data: a review
title_full_unstemmed Dimension reduction methods for microarray data: a review
title_sort dimension reduction methods for microarray data: a review
publisher AIMS Press
series AIMS Bioengineering
issn 2375-1495
publishDate 2017-03-01
description Dimension reduction has become inevitable for pre-processing of high dimensional data. “Gene expression microarray data” is an instance of such high dimensional data. Gene expression microarray data displays the maximum number of genes (features) simultaneously at a molecular level with a very small number of samples. The copious numbers of genes are usually provided to a learning algorithm for producing a complete characterization of the classification task. However, most of the times the majority of the genes are irrelevant or redundant to the learning task. It will deteriorate the learning accuracy and training speed as well as lead to the problem of overfitting. Thus, dimension reduction of microarray data is a crucial preprocessing step for prediction and classification of disease. Various feature selection and feature extraction techniques have been proposed in the literature to identify the genes, that have direct impact on the various machine learning algorithms for classification and eliminate the remaining ones. This paper describes the taxonomy of dimension reduction methods with their characteristics, evaluation criteria, advantages and disadvantages. It also presents a review of numerous dimension reduction approaches for microarray data, mainly those methods that have been proposed over the past few years.
topic DNA microarrays
dimension reduction
classification
prediction
url http://www.aimspress.com/Bioengineering/article/1315/fulltext.html
work_keys_str_mv AT rabiaaziz dimensionreductionmethodsformicroarraydataareview
AT ckverma dimensionreductionmethodsformicroarraydataareview
AT namitasrivastava dimensionreductionmethodsformicroarraydataareview
_version_ 1725896789771419648