A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data

<p>Abstract</p> <p>Background</p> <p>A key challenge in metabonomics is to uncover quantitative associations between multidimensional spectroscopic data and biochemical measures used for disease risk assessment and diagnostics. Here we focus on clinically relevant estim...

Full description

Bibliographic Details
Main Authors: Kaski Kimmo, Hannuksela Minna L, Savolainen Markku J, Mäkelä Sanna M, Ingman Petri, Soininen Pasi, Mäkinen Ville-Petteri, Vehtari Aki, Ala-Korpela Mika
Format: Article
Language:English
Published: BMC 2007-05-01
Series:BMC Bioinformatics
id doaj-0bf76042db444c42b345dfb4070a2698
record_format Article
spelling doaj-0bf76042db444c42b345dfb4070a26982020-11-25T01:34:27ZengBMCBMC Bioinformatics1471-21052007-05-018Suppl 2S810.1186/1471-2105-8-S2-S8A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic dataKaski KimmoHannuksela Minna LSavolainen Markku JMäkelä Sanna MIngman PetriSoininen PasiMäkinen Ville-PetteriVehtari AkiAla-Korpela Mika<p>Abstract</p> <p>Background</p> <p>A key challenge in metabonomics is to uncover quantitative associations between multidimensional spectroscopic data and biochemical measures used for disease risk assessment and diagnostics. Here we focus on clinically relevant estimation of lipoprotein lipids by <sup>1</sup>H NMR spectroscopy of serum.</p> <p>Results</p> <p>A Bayesian methodology, with a biochemical motivation, is presented for a real <sup>1</sup>H NMR metabonomics data set of 75 serum samples. Lipoprotein lipid concentrations were independently obtained for these samples via ultracentrifugation and specific biochemical assays. The Bayesian models were constructed by Markov chain Monte Carlo (MCMC) and they showed remarkably good quantitative performance, the predictive R-values being 0.985 for the very low density lipoprotein triglycerides (VLDL-TG), 0.787 for the intermediate, 0.943 for the low, and 0.933 for the high density lipoprotein cholesterol (IDL-C, LDL-C and HDL-C, respectively). The modelling produced a kernel-based reformulation of the data, the parameters of which coincided with the well-known biochemical characteristics of the <sup>1</sup>H NMR spectra; particularly for VLDL-TG and HDL-C the Bayesian methodology was able to clearly identify the most characteristic resonances within the heavily overlapping information in the spectra. For IDL-C and LDL-C the resulting model kernels were more complex than those for VLDL-TG and HDL-C, probably reflecting the severe overlap of the IDL and LDL resonances in the <sup>1</sup>H NMR spectra.</p> <p>Conclusion</p> <p>The systematic use of Bayesian MCMC analysis is computationally demanding. Nevertheless, the combination of high-quality quantification and the biochemical rationale of the resulting models is expected to be useful in the field of metabonomics.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Kaski Kimmo
Hannuksela Minna L
Savolainen Markku J
Mäkelä Sanna M
Ingman Petri
Soininen Pasi
Mäkinen Ville-Petteri
Vehtari Aki
Ala-Korpela Mika
spellingShingle Kaski Kimmo
Hannuksela Minna L
Savolainen Markku J
Mäkelä Sanna M
Ingman Petri
Soininen Pasi
Mäkinen Ville-Petteri
Vehtari Aki
Ala-Korpela Mika
A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
BMC Bioinformatics
author_facet Kaski Kimmo
Hannuksela Minna L
Savolainen Markku J
Mäkelä Sanna M
Ingman Petri
Soininen Pasi
Mäkinen Ville-Petteri
Vehtari Aki
Ala-Korpela Mika
author_sort Kaski Kimmo
title A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
title_short A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
title_full A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
title_fullStr A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
title_full_unstemmed A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>H NMR metabonomic data
title_sort novel bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in <sup>1</sup>h nmr metabonomic data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2007-05-01
description <p>Abstract</p> <p>Background</p> <p>A key challenge in metabonomics is to uncover quantitative associations between multidimensional spectroscopic data and biochemical measures used for disease risk assessment and diagnostics. Here we focus on clinically relevant estimation of lipoprotein lipids by <sup>1</sup>H NMR spectroscopy of serum.</p> <p>Results</p> <p>A Bayesian methodology, with a biochemical motivation, is presented for a real <sup>1</sup>H NMR metabonomics data set of 75 serum samples. Lipoprotein lipid concentrations were independently obtained for these samples via ultracentrifugation and specific biochemical assays. The Bayesian models were constructed by Markov chain Monte Carlo (MCMC) and they showed remarkably good quantitative performance, the predictive R-values being 0.985 for the very low density lipoprotein triglycerides (VLDL-TG), 0.787 for the intermediate, 0.943 for the low, and 0.933 for the high density lipoprotein cholesterol (IDL-C, LDL-C and HDL-C, respectively). The modelling produced a kernel-based reformulation of the data, the parameters of which coincided with the well-known biochemical characteristics of the <sup>1</sup>H NMR spectra; particularly for VLDL-TG and HDL-C the Bayesian methodology was able to clearly identify the most characteristic resonances within the heavily overlapping information in the spectra. For IDL-C and LDL-C the resulting model kernels were more complex than those for VLDL-TG and HDL-C, probably reflecting the severe overlap of the IDL and LDL resonances in the <sup>1</sup>H NMR spectra.</p> <p>Conclusion</p> <p>The systematic use of Bayesian MCMC analysis is computationally demanding. Nevertheless, the combination of high-quality quantification and the biochemical rationale of the resulting models is expected to be useful in the field of metabonomics.</p>
work_keys_str_mv AT kaskikimmo anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT hannukselaminnal anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT savolainenmarkkuj anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT makelasannam anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT ingmanpetri anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT soininenpasi anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT makinenvillepetteri anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT vehtariaki anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT alakorpelamika anovelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT kaskikimmo novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT hannukselaminnal novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT savolainenmarkkuj novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT makelasannam novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT ingmanpetri novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT soininenpasi novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT makinenvillepetteri novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT vehtariaki novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
AT alakorpelamika novelbayesianapproachtoquantifyclinicalvariablesandtodeterminetheirspectroscopiccounterpartsinsup1suphnmrmetabonomicdata
_version_ 1725072010205200384