Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics
Advances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically rel...
Main Author: | |
---|---|
Published: |
Georgia Institute of Technology
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/1853/33855 |
id |
ndltd-GATECH-oai-smartech.gatech.edu-1853-33855 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-GATECH-oai-smartech.gatech.edu-1853-338552013-01-07T20:35:45ZBiomarker discovery and clinical outcome prediction using knowledge based-bioinformaticsPhan, John H.Feature selectionGene expression profilingBiomarker identificationClinical outcome predictionKnowledge basedBioinformaticsBiochemical markersGene expressionAdvances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically relevant biomarkers from high-throughput data can be unreliable due to the nature of the data--e.g., high technical variability, small sample size, and high dimension size. Due to the lack of available training samples, data-driven machine learning methods are often insufficient without the support of knowledge-based algorithms. We research and investigate the benefits of using knowledge-based algorithms to solve clinical prediction problems. Because we are interested in identifying biomarkers that are also feasible in clinical prediction models, we focus on two analytical components: feature selection and predictive model selection. In addition to data variance, we must also consider the variance of analytical methods. There are many existing feature selection algorithms, each of which may produce different results. Moreover, it is not trivial to identify model parameters that maximize the sensitivity and specificity of clinical prediction. Thus, we introduce a method that uses independently validated biological knowledge to reduce the space of relevant feature selection algorithms and to improve the reliability of clinical predictors. Finally, we implement several functions of this knowledge-based method as a web-based, user-friendly, and standards-compatible software application.Georgia Institute of Technology2010-06-10T15:22:46Z2010-06-10T15:22:46Z2009-04-02Dissertationhttp://hdl.handle.net/1853/33855 |
collection |
NDLTD |
sources |
NDLTD |
topic |
Feature selection Gene expression profiling Biomarker identification Clinical outcome prediction Knowledge based Bioinformatics Biochemical markers Gene expression |
spellingShingle |
Feature selection Gene expression profiling Biomarker identification Clinical outcome prediction Knowledge based Bioinformatics Biochemical markers Gene expression Phan, John H. Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
description |
Advances in high-throughput genomic and proteomic technology have led to a growing interest in cancer biomarkers. These biomarkers can potentially improve the accuracy of cancer subtype prediction and subsequently, the success of therapy. However, identification of statistically and biologically relevant biomarkers from high-throughput data can be unreliable due to the nature of the data--e.g., high technical variability, small sample size, and high dimension size. Due to the lack of available training samples, data-driven machine learning methods are often insufficient without the support of knowledge-based algorithms. We research and investigate the benefits of using knowledge-based algorithms to solve clinical prediction problems. Because we are interested in identifying biomarkers that are also feasible in clinical prediction models, we focus on two analytical components: feature selection and predictive model selection. In addition to data variance, we must also consider the variance of analytical methods. There are many existing feature selection algorithms, each of which may produce different results. Moreover, it is not trivial to identify model parameters that maximize the sensitivity and specificity of clinical prediction. Thus, we introduce a method that uses independently validated biological knowledge to reduce the space of relevant feature selection algorithms and to improve the reliability of clinical predictors. Finally, we implement several functions of this knowledge-based method as a web-based, user-friendly, and standards-compatible software application. |
author |
Phan, John H. |
author_facet |
Phan, John H. |
author_sort |
Phan, John H. |
title |
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
title_short |
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
title_full |
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
title_fullStr |
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
title_full_unstemmed |
Biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
title_sort |
biomarker discovery and clinical outcome prediction using knowledge based-bioinformatics |
publisher |
Georgia Institute of Technology |
publishDate |
2010 |
url |
http://hdl.handle.net/1853/33855 |
work_keys_str_mv |
AT phanjohnh biomarkerdiscoveryandclinicaloutcomepredictionusingknowledgebasedbioinformatics |
_version_ |
1716475268436066304 |