The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2014-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3953406?pdf=render |
id |
doaj-693d0517fdfb436f9cb223d0bc83e506 |
---|---|
record_format |
Article |
spelling |
doaj-693d0517fdfb436f9cb223d0bc83e5062020-11-24T21:27:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0193e9150710.1371/journal.pone.0091507The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.Oliver SerangExact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions.http://europepmc.org/articles/PMC3953406?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Oliver Serang |
spellingShingle |
Oliver Serang The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. PLoS ONE |
author_facet |
Oliver Serang |
author_sort |
Oliver Serang |
title |
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. |
title_short |
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. |
title_full |
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. |
title_fullStr |
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. |
title_full_unstemmed |
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. |
title_sort |
probabilistic convolution tree: efficient exact bayesian inference for faster lc-ms/ms protein inference. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2014-01-01 |
description |
Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions. |
url |
http://europepmc.org/articles/PMC3953406?pdf=render |
work_keys_str_mv |
AT oliverserang theprobabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference AT oliverserang probabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference |
_version_ |
1725976058811908096 |