A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers

DNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially meth...

Full description

Bibliographic Details
Main Authors: Qi Tian, Jianxiao Zou, Yuan Fang, Zhongli Yu, Jianxiong Tang, Ying Song, Shicai Fan
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-09-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2019.00774/full
id doaj-f55340ec598a4ecb8bf29fb3f431b9bd
record_format Article
spelling doaj-f55340ec598a4ecb8bf29fb3f431b9bd2020-11-24T21:36:00ZengFrontiers Media S.A.Frontiers in Genetics1664-80212019-09-011010.3389/fgene.2019.00774471426A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-CancersQi Tian0Jianxiao Zou1Yuan Fang2Zhongli Yu3Jianxiong Tang4Ying Song5Shicai Fan6Shicai Fan7School of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaCenter for Informational Biology, University of Electronic Science and Technology of China, Chengdu, ChinaDNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially methylated loci has become an insightful approach to deepen our understanding of cancers. However, the situation with extremely unbalanced numbers of samples and loci (approximately 1:1,000) makes it rather difficult to explore differential methylation between the sick and the normal. In this article, a hybrid approach based on ensemble feature selection for identifying differentially methylated loci (HyDML) was proposed by incorporating instance perturbation and multiple function models. Experiments on data from The Cancer Genome Atlas showed that HyDML not only achieved effective DML identification, but also outperformed the single-feature selection approach in terms of classification performance and the robustness of feature selection. The intensive analysis of the DML indicated that different types of cancers have mutual patterns, and the stable DML sharing in pan-cancers is of the great potential to be biomarkers, which may strengthen the confidence of domain experts to implement biological validations.https://www.frontiersin.org/article/10.3389/fgene.2019.00774/fullDNA methylationdifferentially methylated lociensemble feature selectionrobustnesspan-cancers
collection DOAJ
language English
format Article
sources DOAJ
author Qi Tian
Jianxiao Zou
Yuan Fang
Zhongli Yu
Jianxiong Tang
Ying Song
Shicai Fan
Shicai Fan
spellingShingle Qi Tian
Jianxiao Zou
Yuan Fang
Zhongli Yu
Jianxiong Tang
Ying Song
Shicai Fan
Shicai Fan
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
Frontiers in Genetics
DNA methylation
differentially methylated loci
ensemble feature selection
robustness
pan-cancers
author_facet Qi Tian
Jianxiao Zou
Yuan Fang
Zhongli Yu
Jianxiong Tang
Ying Song
Shicai Fan
Shicai Fan
author_sort Qi Tian
title A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
title_short A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
title_full A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
title_fullStr A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
title_full_unstemmed A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
title_sort hybrid ensemble approach for identifying robust differentially methylated loci in pan-cancers
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2019-09-01
description DNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially methylated loci has become an insightful approach to deepen our understanding of cancers. However, the situation with extremely unbalanced numbers of samples and loci (approximately 1:1,000) makes it rather difficult to explore differential methylation between the sick and the normal. In this article, a hybrid approach based on ensemble feature selection for identifying differentially methylated loci (HyDML) was proposed by incorporating instance perturbation and multiple function models. Experiments on data from The Cancer Genome Atlas showed that HyDML not only achieved effective DML identification, but also outperformed the single-feature selection approach in terms of classification performance and the robustness of feature selection. The intensive analysis of the DML indicated that different types of cancers have mutual patterns, and the stable DML sharing in pan-cancers is of the great potential to be biomarkers, which may strengthen the confidence of domain experts to implement biological validations.
topic DNA methylation
differentially methylated loci
ensemble feature selection
robustness
pan-cancers
url https://www.frontiersin.org/article/10.3389/fgene.2019.00774/full
work_keys_str_mv AT qitian ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT jianxiaozou ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT yuanfang ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT zhongliyu ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT jianxiongtang ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT yingsong ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT shicaifan ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT shicaifan ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT qitian hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT jianxiaozou hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT yuanfang hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT zhongliyu hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT jianxiongtang hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT yingsong hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT shicaifan hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
AT shicaifan hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers
_version_ 1725942794085728256