Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode

BackgroundThe development of human tumors is associated with the abnormal expression of various functional genes, and a massive tumor-based database needs to be deeply mined. Based on a multigene prediction model, access to urgent prognosis of patients has become possible.Materials and MethodsWe sel...

Full description

Bibliographic Details
Main Authors: Zhou Jiawei, Mu Min, Xing Yingru, Zhang Xin, Li Danting, Liu Yafeng, Xie Jun, Hu Wangfa, Zhang Lijun, Wu Jing, Hu Dong
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-10-01
Series:Frontiers in Molecular Biosciences
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmolb.2020.561456/full
id doaj-2f071f04c5ac4c3a8ce2047df7a74a01
record_format Article
spelling doaj-2f071f04c5ac4c3a8ce2047df7a74a012020-11-25T04:04:42ZengFrontiers Media S.A.Frontiers in Molecular Biosciences2296-889X2020-10-01710.3389/fmolb.2020.561456561456Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic ModeZhou Jiawei0Mu Min1Xing Yingru2Zhang Xin3Li Danting4Liu Yafeng5Xie Jun6Hu Wangfa7Zhang Lijun8Wu Jing9Wu Jing10Hu Dong11Hu Dong12School of Medicine, Anhui University of Science and Technology, Huainan, ChinaKey Laboratory of Industrial Dust Prevention and Control and Occupational Safety and Health, Ministry of Education, Anhui University of Science and Technology, Huainan, ChinaAffiliated Cancer Hospital, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaAffiliated Cancer Hospital, Anhui University of Science and Technology, Huainan, ChinaAffiliated Cancer Hospital, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaKey Laboratory of Industrial Dust Prevention and Control and Occupational Safety and Health, Ministry of Education, Anhui University of Science and Technology, Huainan, ChinaSchool of Medicine, Anhui University of Science and Technology, Huainan, ChinaKey Laboratory of Industrial Dust Prevention and Control and Occupational Safety and Health, Ministry of Education, Anhui University of Science and Technology, Huainan, ChinaBackgroundThe development of human tumors is associated with the abnormal expression of various functional genes, and a massive tumor-based database needs to be deeply mined. Based on a multigene prediction model, access to urgent prognosis of patients has become possible.Materials and MethodsWe selected three RNA expression profiles (GSE32863, GSE10072, and GSE43458) from the lung adenocarcinoma (LUAD) database of the Gene Expression Omnibus (GEO) and analyzed the differentially expressed genes (DEGs) between tumor and normal tissue using GEO2R program. After that, we analyzed the transcriptome data of 479 LUAD samples (54 normal tissue samples and 425 cancer tissue samples) and their clinical follow-up data from the (TCGA) database. Kaplan–Meier (KM) curve and receiver operating characteristic (ROC) were used to assess the prediction model. Multivariate Cox analysis was used to identify independent predictors. TCGA pancreatic adenocarcinoma datasets were used to establish a nomogram model.ResultsWe found 98 significantly prognosis-related genes using KM and COX analysis, among which six genes were found to be the DEGs in GEO. Using multivariate analysis, it was found that a single gene could not be used as an independent predictor of prognosis. However, the risk score calculated by weighting these six genes could serve as an independent prognosis predictor. COX analysis performed with multiple covariates such as age, gender, tumor stage, and TNM typing showed that risk score could still be utilized as an independent risk factor for patient survival rate (p = 0.013) and had an applicable reliability (area under the curve, AUC = 0.665). By combining risk score and various clinical features, the nomogram model was constructed, which had been proven to have high consistency for the prediction of 3- and 5-year survival rate (concordance = 0.751) and high accuracy as tested by ROC (AUC = 0.71;AUC = 0.708).ConclusionWe proposed a method to predict the prognosis of LUAD by weighting multiple genes and constructed a nomogram model suitable for the prognostic evaluation of LUAD, which could provide a new tool for the identification of therapeutic targets and the efficacy evaluation of LUAD.https://www.frontiersin.org/articles/10.3389/fmolb.2020.561456/fulllung adenocarcinomabioinformaticsprognosispredictordata mining
collection DOAJ
language English
format Article
sources DOAJ
author Zhou Jiawei
Mu Min
Xing Yingru
Zhang Xin
Li Danting
Liu Yafeng
Xie Jun
Hu Wangfa
Zhang Lijun
Wu Jing
Wu Jing
Hu Dong
Hu Dong
spellingShingle Zhou Jiawei
Mu Min
Xing Yingru
Zhang Xin
Li Danting
Liu Yafeng
Xie Jun
Hu Wangfa
Zhang Lijun
Wu Jing
Wu Jing
Hu Dong
Hu Dong
Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
Frontiers in Molecular Biosciences
lung adenocarcinoma
bioinformatics
prognosis
predictor
data mining
author_facet Zhou Jiawei
Mu Min
Xing Yingru
Zhang Xin
Li Danting
Liu Yafeng
Xie Jun
Hu Wangfa
Zhang Lijun
Wu Jing
Wu Jing
Hu Dong
Hu Dong
author_sort Zhou Jiawei
title Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
title_short Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
title_full Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
title_fullStr Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
title_full_unstemmed Identification of Key Genes in Lung Adenocarcinoma and Establishment of Prognostic Mode
title_sort identification of key genes in lung adenocarcinoma and establishment of prognostic mode
publisher Frontiers Media S.A.
series Frontiers in Molecular Biosciences
issn 2296-889X
publishDate 2020-10-01
description BackgroundThe development of human tumors is associated with the abnormal expression of various functional genes, and a massive tumor-based database needs to be deeply mined. Based on a multigene prediction model, access to urgent prognosis of patients has become possible.Materials and MethodsWe selected three RNA expression profiles (GSE32863, GSE10072, and GSE43458) from the lung adenocarcinoma (LUAD) database of the Gene Expression Omnibus (GEO) and analyzed the differentially expressed genes (DEGs) between tumor and normal tissue using GEO2R program. After that, we analyzed the transcriptome data of 479 LUAD samples (54 normal tissue samples and 425 cancer tissue samples) and their clinical follow-up data from the (TCGA) database. Kaplan–Meier (KM) curve and receiver operating characteristic (ROC) were used to assess the prediction model. Multivariate Cox analysis was used to identify independent predictors. TCGA pancreatic adenocarcinoma datasets were used to establish a nomogram model.ResultsWe found 98 significantly prognosis-related genes using KM and COX analysis, among which six genes were found to be the DEGs in GEO. Using multivariate analysis, it was found that a single gene could not be used as an independent predictor of prognosis. However, the risk score calculated by weighting these six genes could serve as an independent prognosis predictor. COX analysis performed with multiple covariates such as age, gender, tumor stage, and TNM typing showed that risk score could still be utilized as an independent risk factor for patient survival rate (p = 0.013) and had an applicable reliability (area under the curve, AUC = 0.665). By combining risk score and various clinical features, the nomogram model was constructed, which had been proven to have high consistency for the prediction of 3- and 5-year survival rate (concordance = 0.751) and high accuracy as tested by ROC (AUC = 0.71;AUC = 0.708).ConclusionWe proposed a method to predict the prognosis of LUAD by weighting multiple genes and constructed a nomogram model suitable for the prognostic evaluation of LUAD, which could provide a new tool for the identification of therapeutic targets and the efficacy evaluation of LUAD.
topic lung adenocarcinoma
bioinformatics
prognosis
predictor
data mining
url https://www.frontiersin.org/articles/10.3389/fmolb.2020.561456/full
work_keys_str_mv AT zhoujiawei identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT mumin identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT xingyingru identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT zhangxin identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT lidanting identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT liuyafeng identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT xiejun identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT huwangfa identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT zhanglijun identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT wujing identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT wujing identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT hudong identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
AT hudong identificationofkeygenesinlungadenocarcinomaandestablishmentofprognosticmode
_version_ 1724435693973798912