Assessment of intratumoral heterogeneity with mutations and gene expression profiles.

Intratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the ex...

Full description

Bibliographic Details
Main Authors: Ji-Yong Sung, Hyun-Tae Shin, Kyung-Ah Sohn, Soo-Yong Shin, Woong-Yang Park, Je-Gun Joung
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0219682
id doaj-cc5d24058bf74214a4b32719df9e9db0
record_format Article
spelling doaj-cc5d24058bf74214a4b32719df9e9db02021-03-03T21:12:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01147e021968210.1371/journal.pone.0219682Assessment of intratumoral heterogeneity with mutations and gene expression profiles.Ji-Yong SungHyun-Tae ShinKyung-Ah SohnSoo-Yong ShinWoong-Yang ParkJe-Gun JoungIntratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the expression profile could be utilized for determining ITH level. Herein, we present a novel approach to directly detect high ITH defined as a larger number of subclones from the gene expression pattern through machine learning approaches. We examined associations between gene expression profile and ITH of 12 cancer types from The Cancer Genome Atlas (TCGA) database. Using stomach adenocarcinoma (STAD) showing high association, we evaluated the performance of our method in predicting ITH by employing three machine learning algorithms using gene expression profile data. We classified tumors into high and low heterogeneity groups using the learning model through the selection of LASSO feature. The result showed that support vector machines (SVMs) outperformed other algorithms (AUC = 0.84 in SVMs and 0.82 in Naïve Bayes) and we were able to improve predictive power by using both combined data from mutation and expression. Furthermore, we evaluated the prediction ability of each model using simulation data generated by mixing cell lines of the Cancer Cell Line Encyclopedia (CCLE), and obtained consistent results with using real dataset. Our approach could be utilized for discriminating tumors with heterogeneous cell populations to characterize ITH.https://doi.org/10.1371/journal.pone.0219682
collection DOAJ
language English
format Article
sources DOAJ
author Ji-Yong Sung
Hyun-Tae Shin
Kyung-Ah Sohn
Soo-Yong Shin
Woong-Yang Park
Je-Gun Joung
spellingShingle Ji-Yong Sung
Hyun-Tae Shin
Kyung-Ah Sohn
Soo-Yong Shin
Woong-Yang Park
Je-Gun Joung
Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
PLoS ONE
author_facet Ji-Yong Sung
Hyun-Tae Shin
Kyung-Ah Sohn
Soo-Yong Shin
Woong-Yang Park
Je-Gun Joung
author_sort Ji-Yong Sung
title Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
title_short Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
title_full Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
title_fullStr Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
title_full_unstemmed Assessment of intratumoral heterogeneity with mutations and gene expression profiles.
title_sort assessment of intratumoral heterogeneity with mutations and gene expression profiles.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description Intratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the expression profile could be utilized for determining ITH level. Herein, we present a novel approach to directly detect high ITH defined as a larger number of subclones from the gene expression pattern through machine learning approaches. We examined associations between gene expression profile and ITH of 12 cancer types from The Cancer Genome Atlas (TCGA) database. Using stomach adenocarcinoma (STAD) showing high association, we evaluated the performance of our method in predicting ITH by employing three machine learning algorithms using gene expression profile data. We classified tumors into high and low heterogeneity groups using the learning model through the selection of LASSO feature. The result showed that support vector machines (SVMs) outperformed other algorithms (AUC = 0.84 in SVMs and 0.82 in Naïve Bayes) and we were able to improve predictive power by using both combined data from mutation and expression. Furthermore, we evaluated the prediction ability of each model using simulation data generated by mixing cell lines of the Cancer Cell Line Encyclopedia (CCLE), and obtained consistent results with using real dataset. Our approach could be utilized for discriminating tumors with heterogeneous cell populations to characterize ITH.
url https://doi.org/10.1371/journal.pone.0219682
work_keys_str_mv AT jiyongsung assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
AT hyuntaeshin assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
AT kyungahsohn assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
AT sooyongshin assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
AT woongyangpark assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
AT jegunjoung assessmentofintratumoralheterogeneitywithmutationsandgeneexpressionprofiles
_version_ 1714818237302046720