Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.

In cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical rea...

Full description

Bibliographic Details
Main Authors: Meng-Yun Wu, Dao-Qing Dai, Xiao-Fei Zhang, Yuan Zhu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3684607?pdf=render
id doaj-5a5817b5756247998ca5750defb4bdc7
record_format Article
spelling doaj-5a5817b5756247998ca5750defb4bdc72020-11-24T21:50:58ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0186e6625610.1371/journal.pone.0066256Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.Meng-Yun WuDao-Qing DaiXiao-Fei ZhangYuan ZhuIn cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical reasons. These undiscovered subtypes may be heterogeneous with respect to underlying networks or pathways, and are related with only a few of interdependent biomarkers. This motivates a need for the robust gene expression-based methods capable of discovering such subtypes, elucidating the corresponding network structures and identifying cancer related biomarkers. This study proposes a penalized model-based Student's t clustering with unconstrained covariance (PMT-UC) to discover cancer subtypes with cluster-specific networks, taking gene dependencies into account and having robustness against outliers. Meanwhile, biomarker identification and network reconstruction are achieved by imposing an adaptive [Formula: see text] penalty on the means and the inverse scale matrices. The model is fitted via the expectation maximization algorithm utilizing the graphical lasso. Here, a network-based gene selection criterion that identifies biomarkers not as individual genes but as subnetworks is applied. This allows us to implicate low discriminative biomarkers which play a central role in the subnetwork by interconnecting many differentially expressed genes, or have cluster-specific underlying network structures. Experiment results on simulated datasets and one available cancer dataset attest to the effectiveness, robustness of PMT-UC in cancer subtype discovering. Moveover, PMT-UC has the ability to select cancer related biomarkers which have been verified in biochemical or biomedical research and learn the biological significant correlation among genes.http://europepmc.org/articles/PMC3684607?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Meng-Yun Wu
Dao-Qing Dai
Xiao-Fei Zhang
Yuan Zhu
spellingShingle Meng-Yun Wu
Dao-Qing Dai
Xiao-Fei Zhang
Yuan Zhu
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
PLoS ONE
author_facet Meng-Yun Wu
Dao-Qing Dai
Xiao-Fei Zhang
Yuan Zhu
author_sort Meng-Yun Wu
title Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
title_short Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
title_full Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
title_fullStr Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
title_full_unstemmed Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
title_sort cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description In cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical reasons. These undiscovered subtypes may be heterogeneous with respect to underlying networks or pathways, and are related with only a few of interdependent biomarkers. This motivates a need for the robust gene expression-based methods capable of discovering such subtypes, elucidating the corresponding network structures and identifying cancer related biomarkers. This study proposes a penalized model-based Student's t clustering with unconstrained covariance (PMT-UC) to discover cancer subtypes with cluster-specific networks, taking gene dependencies into account and having robustness against outliers. Meanwhile, biomarker identification and network reconstruction are achieved by imposing an adaptive [Formula: see text] penalty on the means and the inverse scale matrices. The model is fitted via the expectation maximization algorithm utilizing the graphical lasso. Here, a network-based gene selection criterion that identifies biomarkers not as individual genes but as subnetworks is applied. This allows us to implicate low discriminative biomarkers which play a central role in the subnetwork by interconnecting many differentially expressed genes, or have cluster-specific underlying network structures. Experiment results on simulated datasets and one available cancer dataset attest to the effectiveness, robustness of PMT-UC in cancer subtype discovering. Moveover, PMT-UC has the ability to select cancer related biomarkers which have been verified in biochemical or biomedical research and learn the biological significant correlation among genes.
url http://europepmc.org/articles/PMC3684607?pdf=render
work_keys_str_mv AT mengyunwu cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm
AT daoqingdai cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm
AT xiaofeizhang cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm
AT yuanzhu cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm
_version_ 1725881315092332544