A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers
DNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially meth...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2019-09-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fgene.2019.00774/full |
id |
doaj-f55340ec598a4ecb8bf29fb3f431b9bd |
---|---|
record_format |
Article |
spelling |
doaj-f55340ec598a4ecb8bf29fb3f431b9bd2020-11-24T21:36:00ZengFrontiers Media S.A.Frontiers in Genetics1664-80212019-09-011010.3389/fgene.2019.00774471426A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-CancersQi Tian0Jianxiao Zou1Yuan Fang2Zhongli Yu3Jianxiong Tang4Ying Song5Shicai Fan6Shicai Fan7School of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaSchool of Automation Engineering, University of Electronic Science and Technology of ChinaCenter for Informational Biology, University of Electronic Science and Technology of China, Chengdu, ChinaDNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially methylated loci has become an insightful approach to deepen our understanding of cancers. However, the situation with extremely unbalanced numbers of samples and loci (approximately 1:1,000) makes it rather difficult to explore differential methylation between the sick and the normal. In this article, a hybrid approach based on ensemble feature selection for identifying differentially methylated loci (HyDML) was proposed by incorporating instance perturbation and multiple function models. Experiments on data from The Cancer Genome Atlas showed that HyDML not only achieved effective DML identification, but also outperformed the single-feature selection approach in terms of classification performance and the robustness of feature selection. The intensive analysis of the DML indicated that different types of cancers have mutual patterns, and the stable DML sharing in pan-cancers is of the great potential to be biomarkers, which may strengthen the confidence of domain experts to implement biological validations.https://www.frontiersin.org/article/10.3389/fgene.2019.00774/fullDNA methylationdifferentially methylated lociensemble feature selectionrobustnesspan-cancers |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Qi Tian Jianxiao Zou Yuan Fang Zhongli Yu Jianxiong Tang Ying Song Shicai Fan Shicai Fan |
spellingShingle |
Qi Tian Jianxiao Zou Yuan Fang Zhongli Yu Jianxiong Tang Ying Song Shicai Fan Shicai Fan A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers Frontiers in Genetics DNA methylation differentially methylated loci ensemble feature selection robustness pan-cancers |
author_facet |
Qi Tian Jianxiao Zou Yuan Fang Zhongli Yu Jianxiong Tang Ying Song Shicai Fan Shicai Fan |
author_sort |
Qi Tian |
title |
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers |
title_short |
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers |
title_full |
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers |
title_fullStr |
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers |
title_full_unstemmed |
A Hybrid Ensemble Approach for Identifying Robust Differentially Methylated Loci in Pan-Cancers |
title_sort |
hybrid ensemble approach for identifying robust differentially methylated loci in pan-cancers |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Genetics |
issn |
1664-8021 |
publishDate |
2019-09-01 |
description |
DNA methylation is a widely investigated epigenetic mark that plays a vital role in tumorigenesis. Advancements in high-throughput assays, such as the Infinium 450K platform, provide genome-scale DNA methylation landscapes in single-CpG locus resolution, and the identification of differentially methylated loci has become an insightful approach to deepen our understanding of cancers. However, the situation with extremely unbalanced numbers of samples and loci (approximately 1:1,000) makes it rather difficult to explore differential methylation between the sick and the normal. In this article, a hybrid approach based on ensemble feature selection for identifying differentially methylated loci (HyDML) was proposed by incorporating instance perturbation and multiple function models. Experiments on data from The Cancer Genome Atlas showed that HyDML not only achieved effective DML identification, but also outperformed the single-feature selection approach in terms of classification performance and the robustness of feature selection. The intensive analysis of the DML indicated that different types of cancers have mutual patterns, and the stable DML sharing in pan-cancers is of the great potential to be biomarkers, which may strengthen the confidence of domain experts to implement biological validations. |
topic |
DNA methylation differentially methylated loci ensemble feature selection robustness pan-cancers |
url |
https://www.frontiersin.org/article/10.3389/fgene.2019.00774/full |
work_keys_str_mv |
AT qitian ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT jianxiaozou ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT yuanfang ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT zhongliyu ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT jianxiongtang ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT yingsong ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT shicaifan ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT shicaifan ahybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT qitian hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT jianxiaozou hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT yuanfang hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT zhongliyu hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT jianxiongtang hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT yingsong hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT shicaifan hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers AT shicaifan hybridensembleapproachforidentifyingrobustdifferentiallymethylatedlociinpancancers |
_version_ |
1725942794085728256 |