Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis
Abstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasibl...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-02-01
|
Series: | Clinical Epigenetics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13148-019-0621-5 |
id |
doaj-a40a10a5aa204d2180fd20f52ce958d9 |
---|---|
record_format |
Article |
spelling |
doaj-a40a10a5aa204d2180fd20f52ce958d92020-11-24T21:25:54ZengBMCClinical Epigenetics1868-70751868-70832019-02-0111111510.1186/s13148-019-0621-5Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysisChangliang Wang0Liang Chen1Yang Yang2Menglei Zhang3Garry Wong4Cancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauCancer Centre, Centre of Reproduction, Development and Aging, Faculty of Health Sciences, University of MacauAbstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible. Methods Whole blood gene expression data and DNA methylation data were downloaded from Gene Expression Omnibus (GEO) database. A linear model was used to identify significantly differentially expressed genes (DEGs) and differentially methylated genes (DMGs) according to specific gene regions 5′—C—phosphate—G—3′ (CpGs) or all gene regions CpGs in PD. Gene set enrichment analysis was then applied to DEGs and DMGs. Subsequently, data integration analysis was performed to identify robust PD-associated blood biomarkers. Finally, the random forest algorithm and a leave-one-out cross validation method were performed to construct classifiers based on gene expression data integrated with methylation data. Results Eighty-five (85) significantly hypo-methylated and upregulated genes in PD patients compared to healthy controls were identified. The dominant hypo-methylated regions of these genes were significantly different. Some genes had a single dominant hypo-methylated region, while others had multiple dominant hypo-methylated regions. One gene expression classifier and two gene methylation classifiers based on all or dominant methylation-altered region CpGs were constructed. All have a good prediction power for PD. Conclusions Gene expression and methylation data integration analysis identified a blood-based 53-gene signature, which could be applied as a biomarker for PD.http://link.springer.com/article/10.1186/s13148-019-0621-5Parkinson’s diseaseData integrationDNA methylationGene expression |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Changliang Wang Liang Chen Yang Yang Menglei Zhang Garry Wong |
spellingShingle |
Changliang Wang Liang Chen Yang Yang Menglei Zhang Garry Wong Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis Clinical Epigenetics Parkinson’s disease Data integration DNA methylation Gene expression |
author_facet |
Changliang Wang Liang Chen Yang Yang Menglei Zhang Garry Wong |
author_sort |
Changliang Wang |
title |
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis |
title_short |
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis |
title_full |
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis |
title_fullStr |
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis |
title_full_unstemmed |
Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis |
title_sort |
identification of potential blood biomarkers for parkinson’s disease by gene expression and dna methylation data integration analysis |
publisher |
BMC |
series |
Clinical Epigenetics |
issn |
1868-7075 1868-7083 |
publishDate |
2019-02-01 |
description |
Abstract Background Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible. Methods Whole blood gene expression data and DNA methylation data were downloaded from Gene Expression Omnibus (GEO) database. A linear model was used to identify significantly differentially expressed genes (DEGs) and differentially methylated genes (DMGs) according to specific gene regions 5′—C—phosphate—G—3′ (CpGs) or all gene regions CpGs in PD. Gene set enrichment analysis was then applied to DEGs and DMGs. Subsequently, data integration analysis was performed to identify robust PD-associated blood biomarkers. Finally, the random forest algorithm and a leave-one-out cross validation method were performed to construct classifiers based on gene expression data integrated with methylation data. Results Eighty-five (85) significantly hypo-methylated and upregulated genes in PD patients compared to healthy controls were identified. The dominant hypo-methylated regions of these genes were significantly different. Some genes had a single dominant hypo-methylated region, while others had multiple dominant hypo-methylated regions. One gene expression classifier and two gene methylation classifiers based on all or dominant methylation-altered region CpGs were constructed. All have a good prediction power for PD. Conclusions Gene expression and methylation data integration analysis identified a blood-based 53-gene signature, which could be applied as a biomarker for PD. |
topic |
Parkinson’s disease Data integration DNA methylation Gene expression |
url |
http://link.springer.com/article/10.1186/s13148-019-0621-5 |
work_keys_str_mv |
AT changliangwang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis AT liangchen identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis AT yangyang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis AT mengleizhang identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis AT garrywong identificationofpotentialbloodbiomarkersforparkinsonsdiseasebygeneexpressionanddnamethylationdataintegrationanalysis |
_version_ |
1725982008005361664 |