Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction

Abstract Background Current efforts in Metabolomics, such as the Human Metabolome Project, collect structures of biological metabolites as well as data for their characterisation, such as spectra for identification of substances and measurements of thei...

Full description

Bibliographic Details
Main Authors:	Neumann Steffen, Egert Björn, Kuhn Stefan, Steinbeck Christoph
Format:	Article
Language:	English
Published:	BMC 2008-09-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/9/400

id	doaj-ddfd1883f64846eeafd014a7807d0f63
record_format	Article
spelling	doaj-ddfd1883f64846eeafd014a7807d0f632020-11-25T00:27:52ZengBMCBMC Bioinformatics1471-21052008-09-019140010.1186/1471-2105-9-400Building blocks for automated elucidation of metabolites: Machine learning methods for NMR predictionNeumann SteffenEgert BjörnKuhn StefanSteinbeck Christoph<p>Abstract</p> <p>Background</p> <p>Current efforts in Metabolomics, such as the Human Metabolome Project, collect structures of biological metabolites as well as data for their characterisation, such as spectra for identification of substances and measurements of their concentration. Still, only a fraction of existing metabolites and their spectral fingerprints are known. Computer-Assisted Structure Elucidation (CASE) of biological metabolites will be an important tool to leverage this lack of knowledge. Indispensable for CASE are modules to predict spectra for hypothetical structures. This paper evaluates different statistical and machine learning methods to perform predictions of proton NMR spectra based on data from our open database NMRShiftDB.</p> <p>Results</p> <p>A mean absolute error of 0.18 ppm was achieved for the prediction of proton NMR shifts ranging from 0 to 11 ppm. Random forest, J48 decision tree and support vector machines achieved similar overall errors. HOSE codes being a notably simple method achieved a comparatively good result of 0.17 ppm mean absolute error.</p> <p>Conclusion</p> <p>NMR prediction methods applied in the course of this work delivered precise predictions which can serve as a building block for Computer-Assisted Structure Elucidation for biological metabolites.</p> http://www.biomedcentral.com/1471-2105/9/400
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Neumann Steffen Egert Björn Kuhn Stefan Steinbeck Christoph
spellingShingle	Neumann Steffen Egert Björn Kuhn Stefan Steinbeck Christoph Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction BMC Bioinformatics
author_facet	Neumann Steffen Egert Björn Kuhn Stefan Steinbeck Christoph
author_sort	Neumann Steffen
title	Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
title_short	Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
title_full	Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
title_fullStr	Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
title_full_unstemmed	Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
title_sort	building blocks for automated elucidation of metabolites: machine learning methods for nmr prediction
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2008-09-01
description	<p>Abstract</p> <p>Background</p> <p>Current efforts in Metabolomics, such as the Human Metabolome Project, collect structures of biological metabolites as well as data for their characterisation, such as spectra for identification of substances and measurements of their concentration. Still, only a fraction of existing metabolites and their spectral fingerprints are known. Computer-Assisted Structure Elucidation (CASE) of biological metabolites will be an important tool to leverage this lack of knowledge. Indispensable for CASE are modules to predict spectra for hypothetical structures. This paper evaluates different statistical and machine learning methods to perform predictions of proton NMR spectra based on data from our open database NMRShiftDB.</p> <p>Results</p> <p>A mean absolute error of 0.18 ppm was achieved for the prediction of proton NMR shifts ranging from 0 to 11 ppm. Random forest, J48 decision tree and support vector machines achieved similar overall errors. HOSE codes being a notably simple method achieved a comparatively good result of 0.17 ppm mean absolute error.</p> <p>Conclusion</p> <p>NMR prediction methods applied in the course of this work delivered precise predictions which can serve as a building block for Computer-Assisted Structure Elucidation for biological metabolites.</p>
url	http://www.biomedcentral.com/1471-2105/9/400
work_keys_str_mv	AT neumannsteffen buildingblocksforautomatedelucidationofmetabolitesmachinelearningmethodsfornmrprediction AT egertbjorn buildingblocksforautomatedelucidationofmetabolitesmachinelearningmethodsfornmrprediction AT kuhnstefan buildingblocksforautomatedelucidationofmetabolitesmachinelearningmethodsfornmrprediction AT steinbeckchristoph buildingblocksforautomatedelucidationofmetabolitesmachinelearningmethodsfornmrprediction
_version_	1725338050207154176

Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction

Similar Items