The application of the permutation test on genome wide expression analysis

We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targ...

Full description

Bibliographic Details
Main Author:	Chan, Timothy
Language:	English
Published:	2010
Online Access:	http://hdl.handle.net/2429/17660

id	ndltd-UBC-oai-circle.library.ubc.ca-2429-17660
record_format	oai_dc
spelling	ndltd-UBC-oai-circle.library.ubc.ca-2429-176602018-01-05T17:39:00Z The application of the permutation test on genome wide expression analysis Chan, Timothy We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targets. However, like any new exciting technology, great promise and anticipation can lead to wasted resources and false hope. It is critical that we recognize the experimental limitations of these new technologies and most importantly, hidden problems must be addressed. The primary goal of a high-throughput gene expression experiment is to identify genes of interest that are differentially expressed between two sample groups. This thesis addresses two key issues that have hindered high-throughput gene expression technologies. The first is the sample size issue. Small sample sizes affect statistical confidence and are much more sensitive to outliers. Thus, we show that by using a nonparametric statistical test known as the permutation test, we can achieve higher accuracy than conventional parametric statistical tests such as the t-test. The second issue we address is the use of housekeeping genes for normalization of mRNA levels. It is well known that many biological experiments require a set of reference genes that are highly expressed and constant from sample to sample. The choice of reference genes is critical as the wrong choice can have dire effects on subsequent analyses. To address this issue, we developed a methodology based on SAGE, which is a genome wide expression technology that does not require normalization. Our results suggest that reference genes chosen by our methodology are more appropriate for mRNA normalization than the standard set of housekeeping genes. Furthermore, our results suggest that reference genes are more effective if chosen in a tissue-specific manner. Science, Faculty of Computer Science, Department of Graduate 2010-01-06T22:38:58Z 2010-01-06T22:38:58Z 2006 2006-05 Text Thesis/Dissertation http://hdl.handle.net/2429/17660 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
collection	NDLTD
language	English
sources	NDLTD
description	We are now in a new era. The recent completion of the entire sequence of the human genome and high-throughput gene expression technologies has transformed the era of molecular biology to the era of genomics. Already, such technologies are showing great promise in disease classification and gene targets. However, like any new exciting technology, great promise and anticipation can lead to wasted resources and false hope. It is critical that we recognize the experimental limitations of these new technologies and most importantly, hidden problems must be addressed. The primary goal of a high-throughput gene expression experiment is to identify genes of interest that are differentially expressed between two sample groups. This thesis addresses two key issues that have hindered high-throughput gene expression technologies. The first is the sample size issue. Small sample sizes affect statistical confidence and are much more sensitive to outliers. Thus, we show that by using a nonparametric statistical test known as the permutation test, we can achieve higher accuracy than conventional parametric statistical tests such as the t-test. The second issue we address is the use of housekeeping genes for normalization of mRNA levels. It is well known that many biological experiments require a set of reference genes that are highly expressed and constant from sample to sample. The choice of reference genes is critical as the wrong choice can have dire effects on subsequent analyses. To address this issue, we developed a methodology based on SAGE, which is a genome wide expression technology that does not require normalization. Our results suggest that reference genes chosen by our methodology are more appropriate for mRNA normalization than the standard set of housekeeping genes. Furthermore, our results suggest that reference genes are more effective if chosen in a tissue-specific manner. === Science, Faculty of === Computer Science, Department of === Graduate
author	Chan, Timothy
spellingShingle	Chan, Timothy The application of the permutation test on genome wide expression analysis
author_facet	Chan, Timothy
author_sort	Chan, Timothy
title	The application of the permutation test on genome wide expression analysis
title_short	The application of the permutation test on genome wide expression analysis
title_full	The application of the permutation test on genome wide expression analysis
title_fullStr	The application of the permutation test on genome wide expression analysis
title_full_unstemmed	The application of the permutation test on genome wide expression analysis
title_sort	application of the permutation test on genome wide expression analysis
publishDate	2010
url	http://hdl.handle.net/2429/17660
work_keys_str_mv	AT chantimothy theapplicationofthepermutationtestongenomewideexpressionanalysis AT chantimothy applicationofthepermutationtestongenomewideexpressionanalysis
_version_	1718590597227347968

The application of the permutation test on genome wide expression analysis

Similar Items