A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.

The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the ed...

Full description

Bibliographic Details
Main Authors: Monalisa Mandal, Anirban Mukhopadhyay
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3953335?pdf=render
id doaj-799e99cf200d4e4b997d5cd347a94712
record_format Article
spelling doaj-799e99cf200d4e4b997d5cd347a947122020-11-24T21:27:10ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0193e9094910.1371/journal.pone.0090949A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.Monalisa MandalAnirban MukhopadhyayThe purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the edges represent their dissimilarity. Both nodes and edges are given weight according to the feature's relevance and dissimilarity among the features, respectively. The problem of finding relevant and non-redundant features is then mapped into densest subgraph finding problem. We have proposed a multiobjective particle swarm optimization (PSO)-based algorithm that optimizes average node-weight and average edge-weight of the candidate subgraph simultaneously. The proposed algorithm is applied for identifying relevant and non-redundant disease-related genes from microarray gene expression data. The performance of the proposed method is compared with that of several other existing feature selection techniques on different real-life microarray gene expression datasets.http://europepmc.org/articles/PMC3953335?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Monalisa Mandal
Anirban Mukhopadhyay
spellingShingle Monalisa Mandal
Anirban Mukhopadhyay
A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
PLoS ONE
author_facet Monalisa Mandal
Anirban Mukhopadhyay
author_sort Monalisa Mandal
title A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
title_short A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
title_full A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
title_fullStr A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
title_full_unstemmed A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO.
title_sort graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary pso.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the edges represent their dissimilarity. Both nodes and edges are given weight according to the feature's relevance and dissimilarity among the features, respectively. The problem of finding relevant and non-redundant features is then mapped into densest subgraph finding problem. We have proposed a multiobjective particle swarm optimization (PSO)-based algorithm that optimizes average node-weight and average edge-weight of the candidate subgraph simultaneously. The proposed algorithm is applied for identifying relevant and non-redundant disease-related genes from microarray gene expression data. The performance of the proposed method is compared with that of several other existing feature selection techniques on different real-life microarray gene expression datasets.
url http://europepmc.org/articles/PMC3953335?pdf=render
work_keys_str_mv AT monalisamandal agraphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT anirbanmukhopadhyay agraphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT monalisamandal graphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT anirbanmukhopadhyay graphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
_version_ 1725976343727833088