The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.

Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to...

Full description

Bibliographic Details
Main Author: Oliver Serang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3953406?pdf=render
id doaj-693d0517fdfb436f9cb223d0bc83e506
record_format Article
spelling doaj-693d0517fdfb436f9cb223d0bc83e5062020-11-24T21:27:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0193e9150710.1371/journal.pone.0091507The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.Oliver SerangExact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions.http://europepmc.org/articles/PMC3953406?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Oliver Serang
spellingShingle Oliver Serang
The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
PLoS ONE
author_facet Oliver Serang
author_sort Oliver Serang
title The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_short The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_full The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_fullStr The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_full_unstemmed The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_sort probabilistic convolution tree: efficient exact bayesian inference for faster lc-ms/ms protein inference.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions.
url http://europepmc.org/articles/PMC3953406?pdf=render
work_keys_str_mv AT oliverserang theprobabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference
AT oliverserang probabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference
_version_ 1725976058811908096