Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics

Advances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically rel...

Full description

Bibliographic Details
Main Author: Phan, John H.
Published: Georgia Institute of Technology 2010
Subjects:
Online Access:http://hdl.handle.net/1853/33855
id ndltd-GATECH-oai-smartech.gatech.edu-1853-33855
record_format oai_dc
spelling ndltd-GATECH-oai-smartech.gatech.edu-1853-338552013-01-07T20:35:45ZBiomarker discovery and clinical outcome prediction using knowledge based-bioinformaticsPhan, John H.Feature selectionGene expression profilingBiomarker identificationClinical outcome predictionKnowledge basedBioinformaticsBiochemical markersGene expressionAdvances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically relevant biomarkers from high-throughput data can be unreliable due to the nature of the data--e.g., high technical variability, small sample size, and high dimension size. Due to the lack of available training samples, data-driven machine learning methods are often insufficient without the support of knowledge-based algorithms. We research and investigate the benefits of using knowledge-based algorithms to solve clinical prediction problems. Because we are interested in identifying biomarkers that are also feasible in clinical prediction models, we focus on two analytical components: feature selection and predictive model selection. In addition to data variance, we must also consider the variance of analytical methods. There are many existing feature selection algorithms, each of which may produce different results. Moreover, it is not trivial to identify model parameters that maximize the sensitivity and specificity of clinical prediction. Thus, we introduce a method that uses independently validated biological knowledge to reduce the space of relevant feature selection algorithms and to improve the reliability of clinical predictors. Finally, we implement several functions of this knowledge-based method as a web-based, user-friendly, and standards-compatible software application.Georgia Institute of Technology2010-06-10T15:22:46Z2010-06-10T15:22:46Z2009-04-02Dissertationhttp://hdl.handle.net/1853/33855
collection NDLTD
sources NDLTD
topic Feature selection
Gene expression profiling
Biomarker identification
Clinical outcome prediction
Knowledge based
Bioinformatics
Biochemical markers
Gene expression
spellingShingle Feature selection
Gene expression profiling
Biomarker identification
Clinical outcome prediction
Knowledge based
Bioinformatics
Biochemical markers
Gene expression
Phan, John H.
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
description Advances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically relevant biomarkers from high-throughput data can be unreliable due to the nature of the data--e.g., high technical variability, small sample size, and high dimension size. Due to the lack of available training samples, data-driven machine learning methods are often insufficient without the support of knowledge-based algorithms. We research and investigate the benefits of using knowledge-based algorithms to solve clinical prediction problems. Because we are interested in identifying biomarkers that are also feasible in clinical prediction models, we focus on two analytical components: feature selection and predictive model selection. In addition to data variance, we must also consider the variance of analytical methods. There are many existing feature selection algorithms, each of which may produce different results. Moreover, it is not trivial to identify model parameters that maximize the sensitivity and specificity of clinical prediction. Thus, we introduce a method that uses independently validated biological knowledge to reduce the space of relevant feature selection algorithms and to improve the reliability of clinical predictors. Finally, we implement several functions of this knowledge-based method as a web-based, user-friendly, and standards-compatible software application.
author Phan, John H.
author_facet Phan, John H.
author_sort Phan, John H.
title Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
title_short Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
title_full Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
title_fullStr Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
title_full_unstemmed Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
title_sort biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
publisher Georgia Institute of Technology
publishDate 2010
url http://hdl.handle.net/1853/33855
work_keys_str_mv AT phanjohnh biomarkerdiscoveryandclinicaloutcomepredictionusingknowledgebasedbioinformatics
_version_ 1716475268436066304