The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.

Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to...

Full description

Bibliographic Details
Main Author:	Oliver Serang
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2014-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC3953406?pdf=render

id	doaj-693d0517fdfb436f9cb223d0bc83e506
record_format	Article
spelling	doaj-693d0517fdfb436f9cb223d0bc83e5062020-11-24T21:27:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0193e9150710.1371/journal.pone.0091507The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.Oliver SerangExact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions.http://europepmc.org/articles/PMC3953406?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Oliver Serang
spellingShingle	Oliver Serang The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. PLoS ONE
author_facet	Oliver Serang
author_sort	Oliver Serang
title	The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_short	The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_full	The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_fullStr	The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_full_unstemmed	The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.
title_sort	probabilistic convolution tree: efficient exact bayesian inference for faster lc-ms/ms protein inference.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2014-01-01
description	Exact Bayesian inference can sometimes be performed efficiently for special cases where a function has commutative and associative symmetry of its inputs (called "causal independence"). For this reason, it is desirable to exploit such symmetry on big data sets. Here we present a method to exploit a general form of this symmetry on probabilistic adder nodes by transforming those probabilistic adder nodes into a probabilistic convolution tree with which dynamic programming computes exact probabilities. A substantial speedup is demonstrated using an illustration example that can arise when identifying splice forms with bottom-up mass spectrometry-based proteomics. On this example, even state-of-the-art exact inference algorithms require a runtime more than exponential in the number of splice forms considered. By using the probabilistic convolution tree, we reduce the runtime to O(k log(k)2) and the space to O(k log(k)) where k is the number of variables joined by an additive or cardinal operator. This approach, which can also be used with junction tree inference, is applicable to graphs with arbitrary dependency on counting variables or cardinalities and can be used on diverse problems and fields like forward error correcting codes, elemental decomposition, and spectral demixing. The approach also trivially generalizes to multiple dimensions.
url	http://europepmc.org/articles/PMC3953406?pdf=render
work_keys_str_mv	AT oliverserang theprobabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference AT oliverserang probabilisticconvolutiontreeefficientexactbayesianinferenceforfasterlcmsmsproteininference
_version_	1725976058811908096

The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.

Similar Items