Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.
In cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical rea...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3684607?pdf=render |
id |
doaj-5a5817b5756247998ca5750defb4bdc7 |
---|---|
record_format |
Article |
spelling |
doaj-5a5817b5756247998ca5750defb4bdc72020-11-24T21:50:58ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0186e6625610.1371/journal.pone.0066256Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm.Meng-Yun WuDao-Qing DaiXiao-Fei ZhangYuan ZhuIn cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical reasons. These undiscovered subtypes may be heterogeneous with respect to underlying networks or pathways, and are related with only a few of interdependent biomarkers. This motivates a need for the robust gene expression-based methods capable of discovering such subtypes, elucidating the corresponding network structures and identifying cancer related biomarkers. This study proposes a penalized model-based Student's t clustering with unconstrained covariance (PMT-UC) to discover cancer subtypes with cluster-specific networks, taking gene dependencies into account and having robustness against outliers. Meanwhile, biomarker identification and network reconstruction are achieved by imposing an adaptive [Formula: see text] penalty on the means and the inverse scale matrices. The model is fitted via the expectation maximization algorithm utilizing the graphical lasso. Here, a network-based gene selection criterion that identifies biomarkers not as individual genes but as subnetworks is applied. This allows us to implicate low discriminative biomarkers which play a central role in the subnetwork by interconnecting many differentially expressed genes, or have cluster-specific underlying network structures. Experiment results on simulated datasets and one available cancer dataset attest to the effectiveness, robustness of PMT-UC in cancer subtype discovering. Moveover, PMT-UC has the ability to select cancer related biomarkers which have been verified in biochemical or biomedical research and learn the biological significant correlation among genes.http://europepmc.org/articles/PMC3684607?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Meng-Yun Wu Dao-Qing Dai Xiao-Fei Zhang Yuan Zhu |
spellingShingle |
Meng-Yun Wu Dao-Qing Dai Xiao-Fei Zhang Yuan Zhu Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. PLoS ONE |
author_facet |
Meng-Yun Wu Dao-Qing Dai Xiao-Fei Zhang Yuan Zhu |
author_sort |
Meng-Yun Wu |
title |
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
title_short |
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
title_full |
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
title_fullStr |
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
title_full_unstemmed |
Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
title_sort |
cancer subtype discovery and biomarker identification via a new robust network clustering algorithm. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2013-01-01 |
description |
In cancer biology, it is very important to understand the phenotypic changes of the patients and discover new cancer subtypes. Recently, microarray-based technologies have shed light on this problem based on gene expression profiles which may contain outliers due to either chemical or electrical reasons. These undiscovered subtypes may be heterogeneous with respect to underlying networks or pathways, and are related with only a few of interdependent biomarkers. This motivates a need for the robust gene expression-based methods capable of discovering such subtypes, elucidating the corresponding network structures and identifying cancer related biomarkers. This study proposes a penalized model-based Student's t clustering with unconstrained covariance (PMT-UC) to discover cancer subtypes with cluster-specific networks, taking gene dependencies into account and having robustness against outliers. Meanwhile, biomarker identification and network reconstruction are achieved by imposing an adaptive [Formula: see text] penalty on the means and the inverse scale matrices. The model is fitted via the expectation maximization algorithm utilizing the graphical lasso. Here, a network-based gene selection criterion that identifies biomarkers not as individual genes but as subnetworks is applied. This allows us to implicate low discriminative biomarkers which play a central role in the subnetwork by interconnecting many differentially expressed genes, or have cluster-specific underlying network structures. Experiment results on simulated datasets and one available cancer dataset attest to the effectiveness, robustness of PMT-UC in cancer subtype discovering. Moveover, PMT-UC has the ability to select cancer related biomarkers which have been verified in biochemical or biomedical research and learn the biological significant correlation among genes. |
url |
http://europepmc.org/articles/PMC3684607?pdf=render |
work_keys_str_mv |
AT mengyunwu cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm AT daoqingdai cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm AT xiaofeizhang cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm AT yuanzhu cancersubtypediscoveryandbiomarkeridentificationviaanewrobustnetworkclusteringalgorithm |
_version_ |
1725881315092332544 |