TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data

Colorectal cancer (CRC) is one of the leading causes of mortality and morbidity in the world. Being a heterogeneous disease, cancer therapy and prognosis represent a significant challenge to medical care. The molecular information improves the accuracy with which patients are classified and treated...

Full description

Bibliographic Details
Main Authors: Carolina Peixoto, Marta B. Lopes, Marta Martins, Luís Costa, Susana Vinga
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:Biomedicines
Subjects:
Online Access:https://www.mdpi.com/2227-9059/8/11/488
id doaj-2960784dd2494d8da26f9dc5353d940f
record_format Article
spelling doaj-2960784dd2494d8da26f9dc5353d940f2020-11-25T04:09:51ZengMDPI AGBiomedicines2227-90592020-11-01848848810.3390/biomedicines8110488TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival DataCarolina Peixoto0Marta B. Lopes1Marta Martins2Luís Costa3Susana Vinga4INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Rua Alves Redol 9, 1000-029 Lisboa, PortugalNOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, 2829-516 Caparica, PortugalInstituto de Medicina Molecular-João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Avenida Professor Egas Moniz, 1649-028 Lisboa, PortugalInstituto de Medicina Molecular-João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Avenida Professor Egas Moniz, 1649-028 Lisboa, PortugalINESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Rua Alves Redol 9, 1000-029 Lisboa, PortugalColorectal cancer (CRC) is one of the leading causes of mortality and morbidity in the world. Being a heterogeneous disease, cancer therapy and prognosis represent a significant challenge to medical care. The molecular information improves the accuracy with which patients are classified and treated since similar pathologies may show different clinical outcomes and other responses to treatment. However, the high dimensionality of gene expression data makes the selection of novel genes a problematic task. We propose TCox, a novel penalization function for Cox models, which promotes the selection of genes that have distinct correlation patterns in normal vs. tumor tissues. We compare TCox to other regularized survival models, Elastic Net, HubCox, and OrphanCox. Gene expression and clinical data of CRC and normal (TCGA) patients are used for model evaluation. Each model is tested 100 times. Within a specific run, eighteen of the features selected by TCox are also selected by the other survival regression models tested, therefore undoubtedly being crucial players in the survival of colorectal cancer patients. Moreover, the TCox model exclusively selects genes able to categorize patients into significant risk groups. Our work demonstrates the ability of the proposed weighted regularizer TCox to disclose novel molecular drivers in CRC survival by accounting for correlation-based network information from both tumor and normal tissue. The results presented support the relevance of network information for biomarker identification in high-dimensional gene expression data and foster new directions for the development of network-based feature selection methods in precision oncology.https://www.mdpi.com/2227-9059/8/11/488regularized optimizationCox regressionsurvival analysisTCGA dataRNA-seq data
collection DOAJ
language English
format Article
sources DOAJ
author Carolina Peixoto
Marta B. Lopes
Marta Martins
Luís Costa
Susana Vinga
spellingShingle Carolina Peixoto
Marta B. Lopes
Marta Martins
Luís Costa
Susana Vinga
TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
Biomedicines
regularized optimization
Cox regression
survival analysis
TCGA data
RNA-seq data
author_facet Carolina Peixoto
Marta B. Lopes
Marta Martins
Luís Costa
Susana Vinga
author_sort Carolina Peixoto
title TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
title_short TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
title_full TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
title_fullStr TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
title_full_unstemmed TCox: Correlation-Based Regularization Applied to Colorectal Cancer Survival Data
title_sort tcox: correlation-based regularization applied to colorectal cancer survival data
publisher MDPI AG
series Biomedicines
issn 2227-9059
publishDate 2020-11-01
description Colorectal cancer (CRC) is one of the leading causes of mortality and morbidity in the world. Being a heterogeneous disease, cancer therapy and prognosis represent a significant challenge to medical care. The molecular information improves the accuracy with which patients are classified and treated since similar pathologies may show different clinical outcomes and other responses to treatment. However, the high dimensionality of gene expression data makes the selection of novel genes a problematic task. We propose TCox, a novel penalization function for Cox models, which promotes the selection of genes that have distinct correlation patterns in normal vs. tumor tissues. We compare TCox to other regularized survival models, Elastic Net, HubCox, and OrphanCox. Gene expression and clinical data of CRC and normal (TCGA) patients are used for model evaluation. Each model is tested 100 times. Within a specific run, eighteen of the features selected by TCox are also selected by the other survival regression models tested, therefore undoubtedly being crucial players in the survival of colorectal cancer patients. Moreover, the TCox model exclusively selects genes able to categorize patients into significant risk groups. Our work demonstrates the ability of the proposed weighted regularizer TCox to disclose novel molecular drivers in CRC survival by accounting for correlation-based network information from both tumor and normal tissue. The results presented support the relevance of network information for biomarker identification in high-dimensional gene expression data and foster new directions for the development of network-based feature selection methods in precision oncology.
topic regularized optimization
Cox regression
survival analysis
TCGA data
RNA-seq data
url https://www.mdpi.com/2227-9059/8/11/488
work_keys_str_mv AT carolinapeixoto tcoxcorrelationbasedregularizationappliedtocolorectalcancersurvivaldata
AT martablopes tcoxcorrelationbasedregularizationappliedtocolorectalcancersurvivaldata
AT martamartins tcoxcorrelationbasedregularizationappliedtocolorectalcancersurvivaldata
AT luiscosta tcoxcorrelationbasedregularizationappliedtocolorectalcancersurvivaldata
AT susanavinga tcoxcorrelationbasedregularizationappliedtocolorectalcancersurvivaldata
_version_ 1724421480850128896