Association analysis using somatic mutations.

Somatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for som...

Full description

Bibliographic Details
Main Authors: Yang Liu, Qianchuan He, Wei Sun
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-11-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC6235399?pdf=render
id doaj-81cae873968a4790a8f0fd6719185350
record_format Article
spelling doaj-81cae873968a4790a8f0fd67191853502020-11-25T02:11:48ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042018-11-011411e100774610.1371/journal.pgen.1007746Association analysis using somatic mutations.Yang LiuQianchuan HeWei SunSomatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for somatic mutation association studies because somatic mutation calls often have non-ignorable false positive rate and/or false negative rate. While large scale association analysis using somatic mutations becomes feasible recently-thanks for the improvement of sequencing techniques and the reduction of sequencing cost-there is an urgent need for a new statistical method designed for somatic mutation association analysis. We propose such a method with computationally efficient software implementation: Somatic mutation Association test with Measurement Errors (SAME). SAME accounts for somatic mutation calling uncertainty using a likelihood based approach. It can be used to assess the associations between continuous/dichotomous outcomes and individual mutations or gene-level mutations. Through simulation studies across a wide range of realistic scenarios, we show that SAME can significantly improve statistical power than the naive generalized linear model that ignores mutation calling uncertainty. Finally, using the data collected from The Cancer Genome Atlas (TCGA) project, we apply SAME to study the associations between somatic mutations and gene expression in 12 cancer types, as well as the associations between somatic mutations and colon cancer subtype defined by DNA methylation data. SAME recovered some interesting findings that were missed by the generalized linear model. In addition, we demonstrated that mutation-level and gene-level analyses are often more appropriate for oncogene and tumor-suppressor gene, respectively.http://europepmc.org/articles/PMC6235399?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Yang Liu
Qianchuan He
Wei Sun
spellingShingle Yang Liu
Qianchuan He
Wei Sun
Association analysis using somatic mutations.
PLoS Genetics
author_facet Yang Liu
Qianchuan He
Wei Sun
author_sort Yang Liu
title Association analysis using somatic mutations.
title_short Association analysis using somatic mutations.
title_full Association analysis using somatic mutations.
title_fullStr Association analysis using somatic mutations.
title_full_unstemmed Association analysis using somatic mutations.
title_sort association analysis using somatic mutations.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2018-11-01
description Somatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for somatic mutation association studies because somatic mutation calls often have non-ignorable false positive rate and/or false negative rate. While large scale association analysis using somatic mutations becomes feasible recently-thanks for the improvement of sequencing techniques and the reduction of sequencing cost-there is an urgent need for a new statistical method designed for somatic mutation association analysis. We propose such a method with computationally efficient software implementation: Somatic mutation Association test with Measurement Errors (SAME). SAME accounts for somatic mutation calling uncertainty using a likelihood based approach. It can be used to assess the associations between continuous/dichotomous outcomes and individual mutations or gene-level mutations. Through simulation studies across a wide range of realistic scenarios, we show that SAME can significantly improve statistical power than the naive generalized linear model that ignores mutation calling uncertainty. Finally, using the data collected from The Cancer Genome Atlas (TCGA) project, we apply SAME to study the associations between somatic mutations and gene expression in 12 cancer types, as well as the associations between somatic mutations and colon cancer subtype defined by DNA methylation data. SAME recovered some interesting findings that were missed by the generalized linear model. In addition, we demonstrated that mutation-level and gene-level analyses are often more appropriate for oncogene and tumor-suppressor gene, respectively.
url http://europepmc.org/articles/PMC6235399?pdf=render
work_keys_str_mv AT yangliu associationanalysisusingsomaticmutations
AT qianchuanhe associationanalysisusingsomaticmutations
AT weisun associationanalysisusingsomaticmutations
_version_ 1724912507323154432