Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset

<p>Abstract</p> <p>Background</p> <p>Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these stu...

Full description

Bibliographic Details
Main Authors: Al-Hasani Hadi, Beule Dieter, Dreja Tanja, Chadt Alexandra, Tiss Ali, Towers Mark W, Smith Celia J, Kleinjung Frank, Bauer Chris, Reinert Knut, Schuchhardt Johannes, Cramer Rainer
Format: Article
Language:English
Published: BMC 2011-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/140
id doaj-39027ace414d41d7a498a3012cec2f1a
record_format Article
spelling doaj-39027ace414d41d7a498a3012cec2f1a2020-11-25T00:44:40ZengBMCBMC Bioinformatics1471-21052011-05-0112114010.1186/1471-2105-12-140Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model DatasetAl-Hasani HadiBeule DieterDreja TanjaChadt AlexandraTiss AliTowers Mark WSmith Celia JKleinjung FrankBauer ChrisReinert KnutSchuchhardt JohannesCramer Rainer<p>Abstract</p> <p>Background</p> <p>Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics.</p> <p>Results</p> <p>We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods.</p> <p>Conclusions</p> <p>The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author.</p> http://www.biomedcentral.com/1471-2105/12/140
collection DOAJ
language English
format Article
sources DOAJ
author Al-Hasani Hadi
Beule Dieter
Dreja Tanja
Chadt Alexandra
Tiss Ali
Towers Mark W
Smith Celia J
Kleinjung Frank
Bauer Chris
Reinert Knut
Schuchhardt Johannes
Cramer Rainer
spellingShingle Al-Hasani Hadi
Beule Dieter
Dreja Tanja
Chadt Alexandra
Tiss Ali
Towers Mark W
Smith Celia J
Kleinjung Frank
Bauer Chris
Reinert Knut
Schuchhardt Johannes
Cramer Rainer
Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
BMC Bioinformatics
author_facet Al-Hasani Hadi
Beule Dieter
Dreja Tanja
Chadt Alexandra
Tiss Ali
Towers Mark W
Smith Celia J
Kleinjung Frank
Bauer Chris
Reinert Knut
Schuchhardt Johannes
Cramer Rainer
author_sort Al-Hasani Hadi
title Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_short Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_full Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_fullStr Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_full_unstemmed Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_sort biomarker discovery and redundancy reduction towards classification using a multi-factorial maldi-tof ms t2dm mouse model dataset
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-05-01
description <p>Abstract</p> <p>Background</p> <p>Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics.</p> <p>Results</p> <p>We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods.</p> <p>Conclusions</p> <p>The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author.</p>
url http://www.biomedcentral.com/1471-2105/12/140
work_keys_str_mv AT alhasanihadi biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT beuledieter biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT drejatanja biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT chadtalexandra biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT tissali biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT towersmarkw biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT smithceliaj biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT kleinjungfrank biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT bauerchris biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT reinertknut biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT schuchhardtjohannes biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT cramerrainer biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
_version_ 1725274227507986432