Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis

The focus for this research is to develop and apply statistical methods to analyze and interpret high-throughput biological data. We developed a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental s...

Full description

Bibliographic Details
Main Author: Yao, Jianchao
Other Authors: Roux, Stanley J.
Format: Others
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/2152/ETD-UT-2009-12-403
id ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-403
record_format oai_dc
spelling ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-4032015-09-20T16:54:15ZIntegrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysisYao, Jianchaoshrinkage correlation coefficientmicroarrayRNA-SequencingAPY1APY2The focus for this research is to develop and apply statistical methods to analyze and interpret high-throughput biological data. We developed a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. This computational approach is not only applicable to DNA microarray analysis but is also applicable to proteomics data or any other high-throughput analysis methodology. The suppression of APY1 and APY2 in mutants expressing an inducible RNAi system resulted in plants with a dwarf phenotype and disrupted auxin distribution, and we used these mutants to discover what genes changed expression during growth suppression. We evaluated the gene expression changes of apyrase-suppressed RNAi mutants that had been grown in the light and in the darkness, using the NimbleGen Arabidopsis thaliana 4-Plex microarray, respectively. We compared the two sets of large-scale expression data and identified genes whose expression significantly changed after apyrase suppression in light and darkness, respectively. Our results allowed us to highlight some of the genes likely to play major roles in mediating the growth changes that happen when plants drastically reduce their production of APY1 and APY2, some more associated with growth promotion and others, such as stress-induced genes, more associated with growth inhibition. There is a strong rationale for ranking all these genes as prime candidates for mediating the inhibitory growth effects of suppressing apyrase expression, thus the NimbleGen data will serve as a catalyst and valuable guide to the subsequent physiological and molecular experiments that will be needed to clarify the network of gene expression changes that accompany growth inhibition.textRoux, Stanley J.2010-08-16T18:46:31Z2010-08-16T18:46:37Z2010-08-16T18:46:31Z2010-08-16T18:46:37Z2009-122010-08-16December 20092010-08-16T18:46:37Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2009-12-403eng
collection NDLTD
language English
format Others
sources NDLTD
topic shrinkage correlation coefficient
microarray
RNA-Sequencing
APY1
APY2
spellingShingle shrinkage correlation coefficient
microarray
RNA-Sequencing
APY1
APY2
Yao, Jianchao
Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
description The focus for this research is to develop and apply statistical methods to analyze and interpret high-throughput biological data. We developed a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. This computational approach is not only applicable to DNA microarray analysis but is also applicable to proteomics data or any other high-throughput analysis methodology. The suppression of APY1 and APY2 in mutants expressing an inducible RNAi system resulted in plants with a dwarf phenotype and disrupted auxin distribution, and we used these mutants to discover what genes changed expression during growth suppression. We evaluated the gene expression changes of apyrase-suppressed RNAi mutants that had been grown in the light and in the darkness, using the NimbleGen Arabidopsis thaliana 4-Plex microarray, respectively. We compared the two sets of large-scale expression data and identified genes whose expression significantly changed after apyrase suppression in light and darkness, respectively. Our results allowed us to highlight some of the genes likely to play major roles in mediating the growth changes that happen when plants drastically reduce their production of APY1 and APY2, some more associated with growth promotion and others, such as stress-induced genes, more associated with growth inhibition. There is a strong rationale for ranking all these genes as prime candidates for mediating the inhibitory growth effects of suppressing apyrase expression, thus the NimbleGen data will serve as a catalyst and valuable guide to the subsequent physiological and molecular experiments that will be needed to clarify the network of gene expression changes that accompany growth inhibition. === text
author2 Roux, Stanley J.
author_facet Roux, Stanley J.
Yao, Jianchao
author Yao, Jianchao
author_sort Yao, Jianchao
title Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
title_short Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
title_full Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
title_fullStr Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
title_full_unstemmed Integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
title_sort integrative analysis of high-throughput biological data: shrinkage correlation coefficient and comparative expression analysis
publishDate 2010
url http://hdl.handle.net/2152/ETD-UT-2009-12-403
work_keys_str_mv AT yaojianchao integrativeanalysisofhighthroughputbiologicaldatashrinkagecorrelationcoefficientandcomparativeexpressionanalysis
_version_ 1716820882184208384