The application of the permutation test on genome wide expression analysis

We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targ...

Full description

Bibliographic Details
Main Author: Chan, Timothy
Language:English
Published: 2010
Online Access:http://hdl.handle.net/2429/17660
id ndltd-UBC-oai-circle.library.ubc.ca-2429-17660
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-176602018-01-05T17:39:00Z The application of the permutation test on genome wide expression analysis Chan, Timothy We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targets. However, like any new exciting technology, great promise and anticipation can lead to wasted resources and false hope. It is critical that we recognize the experimental limitations of these new technologies and most importantly, hidden problems must be addressed. The primary goal of a high-throughput gene expression experiment is to identify genes of interest that are differentially expressed between two sample groups. This thesis addresses two key issues that have hindered high-throughput gene expression technologies. The first is the sample size issue. Small sample sizes affect statistical confidence and are much more sensitive to outliers. Thus, we show that by using a nonparametric statistical test known as the permutation test, we can achieve higher accuracy than conventional parametric statistical tests such as the t-test. The second issue we address is the use of housekeeping genes for normalization of mRNA levels. It is well known that many biological experiments require a set of reference genes that are highly expressed and constant from sample to sample. The choice of reference genes is critical as the wrong choice can have dire effects on subsequent analyses. To address this issue, we developed a methodology based on SAGE, which is a genome wide expression technology that does not require normalization. Our results suggest that reference genes chosen by our methodology are more appropriate for mRNA normalization than the standard set of housekeeping genes. Furthermore, our results suggest that reference genes are more effective if chosen in a tissue-specific manner. Science, Faculty of Computer Science, Department of Graduate 2010-01-06T22:38:58Z 2010-01-06T22:38:58Z 2006 2006-05 Text Thesis/Dissertation http://hdl.handle.net/2429/17660 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
collection NDLTD
language English
sources NDLTD
description We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targets. However, like any new exciting technology, great promise and anticipation can lead to wasted resources and false hope. It is critical that we recognize the experimental limitations of these new technologies and most importantly, hidden problems must be addressed. The primary goal of a high-throughput gene expression experiment is to identify genes of interest that are differentially expressed between two sample groups. This thesis addresses two key issues that have hindered high-throughput gene expression technologies. The first is the sample size issue. Small sample sizes affect statistical confidence and are much more sensitive to outliers. Thus, we show that by using a nonparametric statistical test known as the permutation test, we can achieve higher accuracy than conventional parametric statistical tests such as the t-test. The second issue we address is the use of housekeeping genes for normalization of mRNA levels. It is well known that many biological experiments require a set of reference genes that are highly expressed and constant from sample to sample. The choice of reference genes is critical as the wrong choice can have dire effects on subsequent analyses. To address this issue, we developed a methodology based on SAGE, which is a genome wide expression technology that does not require normalization. Our results suggest that reference genes chosen by our methodology are more appropriate for mRNA normalization than the standard set of housekeeping genes. Furthermore, our results suggest that reference genes are more effective if chosen in a tissue-specific manner. === Science, Faculty of === Computer Science, Department of === Graduate
author Chan, Timothy
spellingShingle Chan, Timothy
The application of the permutation test on genome wide expression analysis
author_facet Chan, Timothy
author_sort Chan, Timothy
title The application of the permutation test on genome wide expression analysis
title_short The application of the permutation test on genome wide expression analysis
title_full The application of the permutation test on genome wide expression analysis
title_fullStr The application of the permutation test on genome wide expression analysis
title_full_unstemmed The application of the permutation test on genome wide expression analysis
title_sort application of the permutation test on genome wide expression analysis
publishDate 2010
url http://hdl.handle.net/2429/17660
work_keys_str_mv AT chantimothy theapplicationofthepermutationtestongenomewideexpressionanalysis
AT chantimothy applicationofthepermutationtestongenomewideexpressionanalysis
_version_ 1718590597227347968