Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer

Introduction: Extensive research is directed to uncover new biomarkers capable to stratify breast cancer patients into clinically relevant cohorts. However, the overall performance ranking of such marker candidates compared to other genes is virtually absent. Here, we present the ranking of all surv...

Full description

Bibliographic Details
Main Author: Balázs Győrffy
Format: Article
Language:English
Published: Elsevier 2021-01-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037021003044
id doaj-9d82ae31a4864cf6a8fa6a66f5d4fa57
record_format Article
spelling doaj-9d82ae31a4864cf6a8fa6a66f5d4fa572021-07-31T04:38:46ZengElsevierComputational and Structural Biotechnology Journal2001-03702021-01-011941014109Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancerBalázs Győrffy0Semmelweis University Dept. of Bioinformatics, Tűzoltó utca 7-9., 1094 Budapest, Hungary; TTK Momentum Cancer Biomarker Research Group, Institute of Enzymology, Magyar Tudósok körútja 2., 1117 Budapest, Hungary; Semmelweis University 2nd Dept. of Pediatrics, Tűzoltó utca 7-9., 1094 Budapest, Hungary; Address: Department of Bioinformatics, Semmelweis University, Tüzoltó u. 7-9, 1094 Budapest, Hungary.Introduction: Extensive research is directed to uncover new biomarkers capable to stratify breast cancer patients into clinically relevant cohorts. However, the overall performance ranking of such marker candidates compared to other genes is virtually absent. Here, we present the ranking of all survival related genes in chemotherapy treated basal and estrogen positive/HER2 negative breast cancer. Methods: We searched the GEO repository to uncover transcriptomic datasets with available follow-up and clinical data. After quality control and normalization, samples entered an integrated database. Molecular subtypes were designated using gene expression data. Relapse-free survival analysis was performed using Cox proportional hazards regression. False discovery rate was computed to combat multiple hypothesis testing. Kaplan-Meier plots were drawn to visualize the best performing genes. Results: The entire database includes 7,830 unique samples from 55 independent datasets. Of those with available relapse-free survival time, 3,382 samples were estrogen receptor-positive and 696 were basal. In chemotherapy treated ER positive/ERBB2 negative patients the significant prognostic biomarker genes achieved hazard rates between 1.76 and 3.33 with a p value below 5.8E−04. The significant prognostic genes in adjuvant chemotherapy treated basal breast cancer samples reached hazard rates between 1.88 and 3.61 with a p value below 7.2E−04. Our integrated platform was extended enabling the validation of future biomarker candidates. Conclusions: A reference ranking for all genes in two chemotherapy treated breast cancer cohorts is presented. The results help to neglect those with unlikely clinical significance and to focus future research on the most promising candidates.http://www.sciencedirect.com/science/article/pii/S2001037021003044SurvivalBreast cancerChemotherapyBiomarkersPrognosisKaplan-Meier plot
collection DOAJ
language English
format Article
sources DOAJ
author Balázs Győrffy
spellingShingle Balázs Győrffy
Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
Computational and Structural Biotechnology Journal
Survival
Breast cancer
Chemotherapy
Biomarkers
Prognosis
Kaplan-Meier plot
author_facet Balázs Győrffy
author_sort Balázs Győrffy
title Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
title_short Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
title_full Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
title_fullStr Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
title_full_unstemmed Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
title_sort survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer
publisher Elsevier
series Computational and Structural Biotechnology Journal
issn 2001-0370
publishDate 2021-01-01
description Introduction: Extensive research is directed to uncover new biomarkers capable to stratify breast cancer patients into clinically relevant cohorts. However, the overall performance ranking of such marker candidates compared to other genes is virtually absent. Here, we present the ranking of all survival related genes in chemotherapy treated basal and estrogen positive/HER2 negative breast cancer. Methods: We searched the GEO repository to uncover transcriptomic datasets with available follow-up and clinical data. After quality control and normalization, samples entered an integrated database. Molecular subtypes were designated using gene expression data. Relapse-free survival analysis was performed using Cox proportional hazards regression. False discovery rate was computed to combat multiple hypothesis testing. Kaplan-Meier plots were drawn to visualize the best performing genes. Results: The entire database includes 7,830 unique samples from 55 independent datasets. Of those with available relapse-free survival time, 3,382 samples were estrogen receptor-positive and 696 were basal. In chemotherapy treated ER positive/ERBB2 negative patients the significant prognostic biomarker genes achieved hazard rates between 1.76 and 3.33 with a p value below 5.8E−04. The significant prognostic genes in adjuvant chemotherapy treated basal breast cancer samples reached hazard rates between 1.88 and 3.61 with a p value below 7.2E−04. Our integrated platform was extended enabling the validation of future biomarker candidates. Conclusions: A reference ranking for all genes in two chemotherapy treated breast cancer cohorts is presented. The results help to neglect those with unlikely clinical significance and to focus future research on the most promising candidates.
topic Survival
Breast cancer
Chemotherapy
Biomarkers
Prognosis
Kaplan-Meier plot
url http://www.sciencedirect.com/science/article/pii/S2001037021003044
work_keys_str_mv AT balazsgyorffy survivalanalysisacrosstheentiretranscriptomeidentifiesbiomarkerswiththehighestprognosticpowerinbreastcancer
_version_ 1721247056266264576