Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis

Abstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasibl...

Full description

Bibliographic Details
Main Authors: Changliang Wang, Liang Chen, Yang Yang, Menglei Zhang, Garry Wong
Format: Article
Language:English
Published: BMC 2019-02-01
Series:Clinical Epigenetics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13148-019-0621-5
id doaj-a40a10a5aa204d2180fd20f52ce958d9
record_format Article
spelling doaj-a40a10a5aa204d2180fd20f52ce958d92020-11-24T21:25:54ZengBMCClinical Epigenetics1868-70751868-70832019-02-0111111510.1186/s13148-019-0621-5Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysisChangliang Wang0Liang Chen1Yang Yang2Menglei Zhang3Garry Wong4Cancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauAbstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible. Methods Whole blood gene expression data and DNA methylation data were downloaded from Gene Expression Omnibus (GEO) database. A linear model was used to identify significantly differentially expressed genes (DEGs) and differentially methylated genes (DMGs) according to specific gene regions 5′—C—phosphate—G—3′ (CpGs) or all gene regions CpGs in PD. Gene set enrichment analysis was then applied to DEGs and DMGs. Subsequently, data integration analysis was performed to identify robust PD-associated blood biomarkers. Finally, the random forest algorithm and a leave-one-out cross validation method were performed to construct classifiers based on gene expression data integrated with methylation data. Results Eighty-five (85) significantly hypo-methylated and upregulated genes in PD patients compared to healthy controls were identified. The dominant hypo-methylated regions of these genes were significantly different. Some genes had a single dominant hypo-methylated region, while others had multiple dominant hypo-methylated regions. One gene expression classifier and two gene methylation classifiers based on all or dominant methylation-altered region CpGs were constructed. All have a good prediction power for PD. Conclusions Gene expression and methylation data integration analysis identified a blood-based 53-gene signature, which could be applied as a biomarker for PD.http://link.springer.com/article/10.1186/s13148-019-0621-5Parkinson’s diseaseData integrationDNA methylationGene expression
collection DOAJ
language English
format Article
sources DOAJ
author Changliang Wang
Liang Chen
Yang Yang
Menglei Zhang
Garry Wong
spellingShingle Changliang Wang
Liang Chen
Yang Yang
Menglei Zhang
Garry Wong
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
Clinical Epigenetics
Parkinson’s disease
Data integration
DNA methylation
Gene expression
author_facet Changliang Wang
Liang Chen
Yang Yang
Menglei Zhang
Garry Wong
author_sort Changliang Wang
title Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
title_short Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
title_full Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
title_fullStr Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
title_full_unstemmed Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
title_sort identification of potential blood biomarkers for parkinson’s disease by gene expression and dna methylation data integration analysis
publisher BMC
series Clinical Epigenetics
issn 1868-7075
1868-7083
publishDate 2019-02-01
description Abstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible. Methods Whole blood gene expression data and DNA methylation data were downloaded from Gene Expression Omnibus (GEO) database. A linear model was used to identify significantly differentially expressed genes (DEGs) and differentially methylated genes (DMGs) according to specific gene regions 5′—C—phosphate—G—3′ (CpGs) or all gene regions CpGs in PD. Gene set enrichment analysis was then applied to DEGs and DMGs. Subsequently, data integration analysis was performed to identify robust PD-associated blood biomarkers. Finally, the random forest algorithm and a leave-one-out cross validation method were performed to construct classifiers based on gene expression data integrated with methylation data. Results Eighty-five (85) significantly hypo-methylated and upregulated genes in PD patients compared to healthy controls were identified. The dominant hypo-methylated regions of these genes were significantly different. Some genes had a single dominant hypo-methylated region, while others had multiple dominant hypo-methylated regions. One gene expression classifier and two gene methylation classifiers based on all or dominant methylation-altered region CpGs were constructed. All have a good prediction power for PD. Conclusions Gene expression and methylation data integration analysis identified a blood-based 53-gene signature, which could be applied as a biomarker for PD.
topic Parkinson’s disease
Data integration
DNA methylation
Gene expression
url http://link.springer.com/article/10.1186/s13148-019-0621-5
work_keys_str_mv AT changliangwang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis
AT liangchen identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis
AT yangyang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis
AT mengleizhang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis
AT garrywong identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis
_version_ 1725982008005361664