An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data

Abstract It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with...

Full description

Bibliographic Details
Main Authors: Jian Liu, Yuhu Cheng, Xuesong Wang, Lin Zhang, Hui Liu
Format: Article
Language:English
Published: Nature Publishing Group 2017-08-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-017-08881-3
id doaj-7e4716a6321f4f68bc13925dc131810e
record_format Article
spelling doaj-7e4716a6321f4f68bc13925dc131810e2020-12-08T01:43:31ZengNature Publishing GroupScientific Reports2045-23222017-08-017111210.1038/s41598-017-08881-3An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated DataJian Liu0Yuhu Cheng1Xuesong Wang2Lin Zhang3Hui Liu4School of Information and Control Engineering, China University of Mining and TechnologySchool of Information and Control Engineering, China University of Mining and TechnologySchool of Information and Control Engineering, China University of Mining and TechnologySchool of Information and Control Engineering, China University of Mining and TechnologySchool of Information and Control Engineering, China University of Mining and TechnologyAbstract It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1-norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.https://doi.org/10.1038/s41598-017-08881-3
collection DOAJ
language English
format Article
sources DOAJ
author Jian Liu
Yuhu Cheng
Xuesong Wang
Lin Zhang
Hui Liu
spellingShingle Jian Liu
Yuhu Cheng
Xuesong Wang
Lin Zhang
Hui Liu
An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
Scientific Reports
author_facet Jian Liu
Yuhu Cheng
Xuesong Wang
Lin Zhang
Hui Liu
author_sort Jian Liu
title An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
title_short An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
title_full An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
title_fullStr An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
title_full_unstemmed An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data
title_sort optimal mean based block robust feature extraction method to identify colorectal cancer genes with integrated data
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2017-08-01
description Abstract It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1-norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.
url https://doi.org/10.1038/s41598-017-08881-3
work_keys_str_mv AT jianliu anoptimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT yuhucheng anoptimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT xuesongwang anoptimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT linzhang anoptimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT huiliu anoptimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT jianliu optimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT yuhucheng optimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT xuesongwang optimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT linzhang optimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
AT huiliu optimalmeanbasedblockrobustfeatureextractionmethodtoidentifycolorectalcancergeneswithintegrateddata
_version_ 1724394560694517760