Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis

Background: Lung adenocarcinoma (LUAD) is one of the main types of lung cancer. Because of its low early diagnosis rate, poor late prognosis, and high mortality, it is of great significance to find biomarkers for diagnosis and prognosis.Methods: Five hundred and twelve LUADs from The Cancer Genome A...

Full description

Bibliographic Details
Main Authors: Cheng Liu, Xiang Li, Hua Shao, Dan Li
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-12-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2020.565206/full
id doaj-73985f765d4b4593af4b9758fc70603e
record_format Article
spelling doaj-73985f765d4b4593af4b9758fc70603e2020-12-21T06:44:28ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-12-011110.3389/fgene.2020.565206565206Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting PrognosisCheng Liu0Xiang Li1Hua Shao2Dan Li3Department of Thoracic Surgery, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, ChinaDepartment of Neurology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, ChinaDepartment of Neurology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, ChinaDepartment of Neurology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, ChinaBackground: Lung adenocarcinoma (LUAD) is one of the main types of lung cancer. Because of its low early diagnosis rate, poor late prognosis, and high mortality, it is of great significance to find biomarkers for diagnosis and prognosis.Methods: Five hundred and twelve LUADs from The Cancer Genome Atlas were used for differential expression analysis and short time-series expression miner (STEM) analysis to identify the LUAD-development characteristic genes. Survival analysis was used to identify the LUAD-unfavorable genes and LUAD-favorable genes. Gene set variation analysis (GSVA) was used to score individual samples against the two gene sets. Receiver operating characteristic (ROC) curve analysis and univariate and multivariate Cox regression analysis were used to explore the diagnostic and prognostic ability of the two GSVA score systems. Two independent data sets from Gene Expression Omnibus (GEO) were used for verifying the results. Functional enrichment analysis was used to explore the potential biological functions of LUAD-unfavorable genes.Results: With the development of LUAD, 185 differentially expressed genes (DEGs) were gradually upregulated, of which 84 genes were associated with LUAD survival and named as LUAD-unfavorable gene set. While 237 DEGs were gradually downregulated, of which 39 genes were associated with LUAD survival and named as LUAD-favorable gene set. ROC curve analysis and univariate/multivariate Cox proportional hazards analyses indicated both of LUAD-unfavorable GSVA score and LUAD-favorable GSVA score were a biomarker of LUAD. Moreover, both of these two GSVA score systems were an independent factor for LUAD prognosis. The LUAD-unfavorable genes were significantly involved in p53 signaling pathway, Oocyte meiosis, and Cell cycle.Conclusion: We identified and validated two LUAD-development characteristic gene sets that not only have diagnostic value but also prognostic value. It may provide new insight for further research on LUAD.https://www.frontiersin.org/articles/10.3389/fgene.2020.565206/fulllung adenocarcinomaprognostic stratification systemThe Cancer Genome Atlasgene set variation analysis scorepredicting prognosis
collection DOAJ
language English
format Article
sources DOAJ
author Cheng Liu
Xiang Li
Hua Shao
Dan Li
spellingShingle Cheng Liu
Xiang Li
Hua Shao
Dan Li
Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
Frontiers in Genetics
lung adenocarcinoma
prognostic stratification system
The Cancer Genome Atlas
gene set variation analysis score
predicting prognosis
author_facet Cheng Liu
Xiang Li
Hua Shao
Dan Li
author_sort Cheng Liu
title Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
title_short Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
title_full Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
title_fullStr Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
title_full_unstemmed Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis
title_sort identification and validation of two lung adenocarcinoma-development characteristic gene sets for diagnosing lung adenocarcinoma and predicting prognosis
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-12-01
description Background: Lung adenocarcinoma (LUAD) is one of the main types of lung cancer. Because of its low early diagnosis rate, poor late prognosis, and high mortality, it is of great significance to find biomarkers for diagnosis and prognosis.Methods: Five hundred and twelve LUADs from The Cancer Genome Atlas were used for differential expression analysis and short time-series expression miner (STEM) analysis to identify the LUAD-development characteristic genes. Survival analysis was used to identify the LUAD-unfavorable genes and LUAD-favorable genes. Gene set variation analysis (GSVA) was used to score individual samples against the two gene sets. Receiver operating characteristic (ROC) curve analysis and univariate and multivariate Cox regression analysis were used to explore the diagnostic and prognostic ability of the two GSVA score systems. Two independent data sets from Gene Expression Omnibus (GEO) were used for verifying the results. Functional enrichment analysis was used to explore the potential biological functions of LUAD-unfavorable genes.Results: With the development of LUAD, 185 differentially expressed genes (DEGs) were gradually upregulated, of which 84 genes were associated with LUAD survival and named as LUAD-unfavorable gene set. While 237 DEGs were gradually downregulated, of which 39 genes were associated with LUAD survival and named as LUAD-favorable gene set. ROC curve analysis and univariate/multivariate Cox proportional hazards analyses indicated both of LUAD-unfavorable GSVA score and LUAD-favorable GSVA score were a biomarker of LUAD. Moreover, both of these two GSVA score systems were an independent factor for LUAD prognosis. The LUAD-unfavorable genes were significantly involved in p53 signaling pathway, Oocyte meiosis, and Cell cycle.Conclusion: We identified and validated two LUAD-development characteristic gene sets that not only have diagnostic value but also prognostic value. It may provide new insight for further research on LUAD.
topic lung adenocarcinoma
prognostic stratification system
The Cancer Genome Atlas
gene set variation analysis score
predicting prognosis
url https://www.frontiersin.org/articles/10.3389/fgene.2020.565206/full
work_keys_str_mv AT chengliu identificationandvalidationoftwolungadenocarcinomadevelopmentcharacteristicgenesetsfordiagnosinglungadenocarcinomaandpredictingprognosis
AT xiangli identificationandvalidationoftwolungadenocarcinomadevelopmentcharacteristicgenesetsfordiagnosinglungadenocarcinomaandpredictingprognosis
AT huashao identificationandvalidationoftwolungadenocarcinomadevelopmentcharacteristicgenesetsfordiagnosinglungadenocarcinomaandpredictingprognosis
AT danli identificationandvalidationoftwolungadenocarcinomadevelopmentcharacteristicgenesetsfordiagnosinglungadenocarcinomaandpredictingprognosis
_version_ 1724375440206856192