Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
Intratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the ex...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0219682 |
id |
doaj-cc5d24058bf74214a4b32719df9e9db0 |
---|---|
record_format |
Article |
spelling |
doaj-cc5d24058bf74214a4b32719df9e9db02021-03-03T21:12:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01147e021968210.1371/journal.pone.0219682Assessment of intratumoral heterogeneity with mutations and gene expression profiles.Ji-Yong SungHyun-Tae ShinKyung-Ah SohnSoo-Yong ShinWoong-Yang ParkJe-Gun JoungIntratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the expression profile could be utilized for determining ITH level. Herein, we present a novel approach to directly detect high ITH defined as a larger number of subclones from the gene expression pattern through machine learning approaches. We examined associations between gene expression profile and ITH of 12 cancer types from The Cancer Genome Atlas (TCGA) database. Using stomach adenocarcinoma (STAD) showing high association, we evaluated the performance of our method in predicting ITH by employing three machine learning algorithms using gene expression profile data. We classified tumors into high and low heterogeneity groups using the learning model through the selection of LASSO feature. The result showed that support vector machines (SVMs) outperformed other algorithms (AUC = 0.84 in SVMs and 0.82 in Naïve Bayes) and we were able to improve predictive power by using both combined data from mutation and expression. Furthermore, we evaluated the prediction ability of each model using simulation data generated by mixing cell lines of the Cancer Cell Line Encyclopedia (CCLE), and obtained consistent results with using real dataset. Our approach could be utilized for discriminating tumors with heterogeneous cell populations to characterize ITH.https://doi.org/10.1371/journal.pone.0219682 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ji-Yong Sung Hyun-Tae Shin Kyung-Ah Sohn Soo-Yong Shin Woong-Yang Park Je-Gun Joung |
spellingShingle |
Ji-Yong Sung Hyun-Tae Shin Kyung-Ah Sohn Soo-Yong Shin Woong-Yang Park Je-Gun Joung Assessment of intratumoral heterogeneity with mutations and gene expression profiles. PLoS ONE |
author_facet |
Ji-Yong Sung Hyun-Tae Shin Kyung-Ah Sohn Soo-Yong Shin Woong-Yang Park Je-Gun Joung |
author_sort |
Ji-Yong Sung |
title |
Assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
title_short |
Assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
title_full |
Assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
title_fullStr |
Assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
title_full_unstemmed |
Assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
title_sort |
assessment of intratumoral heterogeneity with mutations and gene expression profiles. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2019-01-01 |
description |
Intratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the expression profile could be utilized for determining ITH level. Herein, we present a novel approach to directly detect high ITH defined as a larger number of subclones from the gene expression pattern through machine learning approaches. We examined associations between gene expression profile and ITH of 12 cancer types from The Cancer Genome Atlas (TCGA) database. Using stomach adenocarcinoma (STAD) showing high association, we evaluated the performance of our method in predicting ITH by employing three machine learning algorithms using gene expression profile data. We classified tumors into high and low heterogeneity groups using the learning model through the selection of LASSO feature. The result showed that support vector machines (SVMs) outperformed other algorithms (AUC = 0.84 in SVMs and 0.82 in Naïve Bayes) and we were able to improve predictive power by using both combined data from mutation and expression. Furthermore, we evaluated the prediction ability of each model using simulation data generated by mixing cell lines of the Cancer Cell Line Encyclopedia (CCLE), and obtained consistent results with using real dataset. Our approach could be utilized for discriminating tumors with heterogeneous cell populations to characterize ITH. |
url |
https://doi.org/10.1371/journal.pone.0219682 |
work_keys_str_mv |
AT jiyongsung assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles AT hyuntaeshin assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles AT kyungahsohn assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles AT sooyongshin assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles AT woongyangpark assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles AT jegunjoung assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles |
_version_ |
1714818237302046720 |