Association analysis using somatic mutations.
Somatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for som...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-11-01
|
Series: | PLoS Genetics |
Online Access: | http://europepmc.org/articles/PMC6235399?pdf=render |
id |
doaj-81cae873968a4790a8f0fd6719185350 |
---|---|
record_format |
Article |
spelling |
doaj-81cae873968a4790a8f0fd67191853502020-11-25T02:11:48ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042018-11-011411e100774610.1371/journal.pgen.1007746Association analysis using somatic mutations.Yang LiuQianchuan HeWei SunSomatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for somatic mutation association studies because somatic mutation calls often have non-ignorable false positive rate and/or false negative rate. While large scale association analysis using somatic mutations becomes feasible recently-thanks for the improvement of sequencing techniques and the reduction of sequencing cost-there is an urgent need for a new statistical method designed for somatic mutation association analysis. We propose such a method with computationally efficient software implementation: Somatic mutation Association test with Measurement Errors (SAME). SAME accounts for somatic mutation calling uncertainty using a likelihood based approach. It can be used to assess the associations between continuous/dichotomous outcomes and individual mutations or gene-level mutations. Through simulation studies across a wide range of realistic scenarios, we show that SAME can significantly improve statistical power than the naive generalized linear model that ignores mutation calling uncertainty. Finally, using the data collected from The Cancer Genome Atlas (TCGA) project, we apply SAME to study the associations between somatic mutations and gene expression in 12 cancer types, as well as the associations between somatic mutations and colon cancer subtype defined by DNA methylation data. SAME recovered some interesting findings that were missed by the generalized linear model. In addition, we demonstrated that mutation-level and gene-level analyses are often more appropriate for oncogene and tumor-suppressor gene, respectively.http://europepmc.org/articles/PMC6235399?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yang Liu Qianchuan He Wei Sun |
spellingShingle |
Yang Liu Qianchuan He Wei Sun Association analysis using somatic mutations. PLoS Genetics |
author_facet |
Yang Liu Qianchuan He Wei Sun |
author_sort |
Yang Liu |
title |
Association analysis using somatic mutations. |
title_short |
Association analysis using somatic mutations. |
title_full |
Association analysis using somatic mutations. |
title_fullStr |
Association analysis using somatic mutations. |
title_full_unstemmed |
Association analysis using somatic mutations. |
title_sort |
association analysis using somatic mutations. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Genetics |
issn |
1553-7390 1553-7404 |
publishDate |
2018-11-01 |
description |
Somatic mutations drive the growth of tumor cells and are pivotal biomarkers for many cancer treatments. Genetic association analysis using somatic mutations is an effective approach to study the functional impact of somatic mutations. However, standard regression methods are not appropriate for somatic mutation association studies because somatic mutation calls often have non-ignorable false positive rate and/or false negative rate. While large scale association analysis using somatic mutations becomes feasible recently-thanks for the improvement of sequencing techniques and the reduction of sequencing cost-there is an urgent need for a new statistical method designed for somatic mutation association analysis. We propose such a method with computationally efficient software implementation: Somatic mutation Association test with Measurement Errors (SAME). SAME accounts for somatic mutation calling uncertainty using a likelihood based approach. It can be used to assess the associations between continuous/dichotomous outcomes and individual mutations or gene-level mutations. Through simulation studies across a wide range of realistic scenarios, we show that SAME can significantly improve statistical power than the naive generalized linear model that ignores mutation calling uncertainty. Finally, using the data collected from The Cancer Genome Atlas (TCGA) project, we apply SAME to study the associations between somatic mutations and gene expression in 12 cancer types, as well as the associations between somatic mutations and colon cancer subtype defined by DNA methylation data. SAME recovered some interesting findings that were missed by the generalized linear model. In addition, we demonstrated that mutation-level and gene-level analyses are often more appropriate for oncogene and tumor-suppressor gene, respectively. |
url |
http://europepmc.org/articles/PMC6235399?pdf=render |
work_keys_str_mv |
AT yangliu associationanalysisusingsomaticmutations AT qianchuanhe associationanalysisusingsomaticmutations AT weisun associationanalysisusingsomaticmutations |
_version_ |
1724912507323154432 |