Evolutionarily conserved regulatory programs

Despite the diversity of metazoans, common biochemical systems and structures can be found in distinct taxonomic groups. The development and formation of metazoan tissues and structures has been well researched, but their regulatory mechanisms are not understood well. To this end, we implemented bio...

Full description

Bibliographic Details
Main Author: Kwon, Tae-Jun Andrew
Language:English
Published: University of British Columbia 2011
Online Access:http://hdl.handle.net/2429/37062
id ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-37062
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-370622014-03-26T03:38:08Z Evolutionarily conserved regulatory programs Kwon, Tae-Jun Andrew Despite the diversity of metazoans, common biochemical systems and structures can be found in distinct taxonomic groups. The development and formation of metazoan tissues and structures has been well researched, but their regulatory mechanisms are not understood well. To this end, we implemented bioinformatics tools regulatory mechanism analysis and applied them to study regulatory program conservation with an emphasis on muscle development. We first performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition and ChIP-Seq evidence as important characteristics to incorporate in future methods for improved predictive specificity. In studying the transcriptional regulation, motif enrichment analysis of co-expressed genes is often employed to determine mediating transcription factors. We built oPOSSUM-3, a web-based software system for identification of enriched transcription factor binding sites (TFBS) and TFBS families in DNA sequences of co-expressed genes and sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of published data demonstrates the capacity for oPOSSUM-3 to identify mediating TFs for co-regulated genes. Studies have shown that TF binding profiles tend to be highly conserved over long evolutionary distances. In large-scale public genome annotation projects, such as modENCODE, transcriptional regulation data is compiled for comparative genomics research. Using the oPOSSUM-3 system and published data, we performed comparative analyses of the regulatory programs across evolutionarily divergent species, including human, fruit fly, and nematode, and examined the extent of conservation in major regulatory programs. The thesis research provides new approaches to computational analysis of DNA sequences and insights into the analysis of transcription regulation across the phylogenetic spectrum. 2011-09-01T15:03:59Z 2011-09-01T15:03:59Z 2011 2011-09-01 2011-11 Electronic Thesis or Dissertation http://hdl.handle.net/2429/37062 eng University of British Columbia
collection NDLTD
language English
sources NDLTD
description Despite the diversity of metazoans, common biochemical systems and structures can be found in distinct taxonomic groups. The development and formation of metazoan tissues and structures has been well researched, but their regulatory mechanisms are not understood well. To this end, we implemented bioinformatics tools regulatory mechanism analysis and applied them to study regulatory program conservation with an emphasis on muscle development. We first performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition and ChIP-Seq evidence as important characteristics to incorporate in future methods for improved predictive specificity. In studying the transcriptional regulation, motif enrichment analysis of co-expressed genes is often employed to determine mediating transcription factors. We built oPOSSUM-3, a web-based software system for identification of enriched transcription factor binding sites (TFBS) and TFBS families in DNA sequences of co-expressed genes and sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of published data demonstrates the capacity for oPOSSUM-3 to identify mediating TFs for co-regulated genes. Studies have shown that TF binding profiles tend to be highly conserved over long evolutionary distances. In large-scale public genome annotation projects, such as modENCODE, transcriptional regulation data is compiled for comparative genomics research. Using the oPOSSUM-3 system and published data, we performed comparative analyses of the regulatory programs across evolutionarily divergent species, including human, fruit fly, and nematode, and examined the extent of conservation in major regulatory programs. The thesis research provides new approaches to computational analysis of DNA sequences and insights into the analysis of transcription regulation across the phylogenetic spectrum.
author Kwon, Tae-Jun Andrew
spellingShingle Kwon, Tae-Jun Andrew
Evolutionarily conserved regulatory programs
author_facet Kwon, Tae-Jun Andrew
author_sort Kwon, Tae-Jun Andrew
title Evolutionarily conserved regulatory programs
title_short Evolutionarily conserved regulatory programs
title_full Evolutionarily conserved regulatory programs
title_fullStr Evolutionarily conserved regulatory programs
title_full_unstemmed Evolutionarily conserved regulatory programs
title_sort evolutionarily conserved regulatory programs
publisher University of British Columbia
publishDate 2011
url http://hdl.handle.net/2429/37062
work_keys_str_mv AT kwontaejunandrew evolutionarilyconservedregulatoryprograms
_version_ 1716656056438882304