Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation
Abstract Background Hepatocellular carcinoma (HCC) is one of the most common and invasive malignant tumors in the world. The change in DNA methylation is a key event in HCC. Methods Methylation datasets for HCC and 17 other types of cancer were downloaded from The Cancer Genome Atlas (TCGA). The CpG...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-06-01
|
Series: | BMC Medical Genomics |
Online Access: | https://doi.org/10.1186/s12920-021-01015-9 |
id |
doaj-8ec585956cf2467d90146954c1c13868 |
---|---|
record_format |
Article |
spelling |
doaj-8ec585956cf2467d90146954c1c138682021-06-20T11:08:23ZengBMCBMC Medical Genomics1755-87942021-06-0114111310.1186/s12920-021-01015-9Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluationRui-kun Zhang0Jia-lin Liu1Health Science Center, Shenzhen UniversityDepartment of Hepatobiliary and Pancreatic Surgery, Shenzhen Traditional Chinese Medicine HospitalAbstract Background Hepatocellular carcinoma (HCC) is one of the most common and invasive malignant tumors in the world. The change in DNA methylation is a key event in HCC. Methods Methylation datasets for HCC and 17 other types of cancer were downloaded from The Cancer Genome Atlas (TCGA). The CpG sites with large differences in methylation between tumor tissues and paracancerous tissues were identified. We used the HCC methylation dataset downloaded from the TCGA as the training set and removed the overlapping sites among all cancer datasets to ensure that only CpG sites specific to HCC remained. Logistic regression analysis was performed to select specific biomarkers that can be used to diagnose HCC, and two datasets—GSE157341 and GSE54503—downloaded from GEO as validation sets were used to validate our model. We also used a Cox regression model to select CpG sites related to patient prognosis. Results We identified 6 HCC-specific methylated CpG sites as biomarkers for HCC diagnosis. In the training set, the area under the receiver operating characteristic (ROC) curve (AUC) for the model containing all these sites was 0.971. The AUCs were 0.8802 and 0.9711 for the two validation sets from the GEO database. In addition, 3 other CpG sites were analyzed and used to create a risk scoring model for patient prognosis and survival prediction. Conclusions Through the analysis of HCC methylation datasets from the TCGA and Gene Expression Omnibus (GEO) databases, potential biomarkers for HCC diagnosis and prognosis evaluation were ascertained.https://doi.org/10.1186/s12920-021-01015-9 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rui-kun Zhang Jia-lin Liu |
spellingShingle |
Rui-kun Zhang Jia-lin Liu Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation BMC Medical Genomics |
author_facet |
Rui-kun Zhang Jia-lin Liu |
author_sort |
Rui-kun Zhang |
title |
Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation |
title_short |
Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation |
title_full |
Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation |
title_fullStr |
Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation |
title_full_unstemmed |
Screening the genome for HCC-specific CpG methylation signatures as biomarkers for diagnosis and prognosis evaluation |
title_sort |
screening the genome for hcc-specific cpg methylation signatures as biomarkers for diagnosis and prognosis evaluation |
publisher |
BMC |
series |
BMC Medical Genomics |
issn |
1755-8794 |
publishDate |
2021-06-01 |
description |
Abstract Background Hepatocellular carcinoma (HCC) is one of the most common and invasive malignant tumors in the world. The change in DNA methylation is a key event in HCC. Methods Methylation datasets for HCC and 17 other types of cancer were downloaded from The Cancer Genome Atlas (TCGA). The CpG sites with large differences in methylation between tumor tissues and paracancerous tissues were identified. We used the HCC methylation dataset downloaded from the TCGA as the training set and removed the overlapping sites among all cancer datasets to ensure that only CpG sites specific to HCC remained. Logistic regression analysis was performed to select specific biomarkers that can be used to diagnose HCC, and two datasets—GSE157341 and GSE54503—downloaded from GEO as validation sets were used to validate our model. We also used a Cox regression model to select CpG sites related to patient prognosis. Results We identified 6 HCC-specific methylated CpG sites as biomarkers for HCC diagnosis. In the training set, the area under the receiver operating characteristic (ROC) curve (AUC) for the model containing all these sites was 0.971. The AUCs were 0.8802 and 0.9711 for the two validation sets from the GEO database. In addition, 3 other CpG sites were analyzed and used to create a risk scoring model for patient prognosis and survival prediction. Conclusions Through the analysis of HCC methylation datasets from the TCGA and Gene Expression Omnibus (GEO) databases, potential biomarkers for HCC diagnosis and prognosis evaluation were ascertained. |
url |
https://doi.org/10.1186/s12920-021-01015-9 |
work_keys_str_mv |
AT ruikunzhang screeningthegenomeforhccspecificcpgmethylationsignaturesasbiomarkersfordiagnosisandprognosisevaluation AT jialinliu screeningthegenomeforhccspecificcpgmethylationsignaturesasbiomarkersfordiagnosisandprognosisevaluation |
_version_ |
1721370463676923904 |