High-resolution genome-wide functional dissection of transcriptional regulatory regions and nucleotides in human

Genome-wide epigenomic maps have revealed millions of putative enhancers and promoters, but experimental validation of their function and high-resolution dissection of their driver nucleotides remain limited. Here, we present HiDRA (High-resolution Dissection of Regulatory Activity), a combined expe...

Full description

Bibliographic Details
Main Authors: Goggin, Sarah M. (Author), Saadat, Alham (Author), Wang, Li (Author), Sinnott-Armstrong, Nasa (Author), Wang, Xinchen (Contributor), He, Liang (Contributor), Kellis, Manolis (Contributor), Claussnitzer, Melina (Author)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Biology (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Claussnitzer, MelinaChristine (Contributor)
Format: Article
Language:English
Published: Nature Publishing Group, 2019-03-26T15:50:59Z.
Subjects:
Online Access:Get fulltext
Description
Summary:Genome-wide epigenomic maps have revealed millions of putative enhancers and promoters, but experimental validation of their function and high-resolution dissection of their driver nucleotides remain limited. Here, we present HiDRA (High-resolution Dissection of Regulatory Activity), a combined experimental and computational method for high-resolution genome-wide testing and dissection of putative regulatory regions. We test ~7 million accessible DNA fragments in a single experiment, by coupling accessible chromatin extraction with self-transcribing episomal reporters (ATAC-STARR-seq). By design, fragments are highly overlapping in densely-sampled accessible regions, enabling us to pinpoint driver regulatory nucleotides by exploiting differences in activity between partially-overlapping fragments using a machine learning model (SHARPR-RE). In GM12878 lymphoblastoid cells, we find ~65,000 regions showing enhancer function, and pinpoint ~13,000 high-resolution driver elements. These are enriched for regulatory motifs, evolutionarily-conserved nucleotides, and disease-associated genetic variants from genome-wide association studies. Overall, HiDRA provides a high-throughput, high-resolution approach for dissecting regulatory regions and driver nucleotides.
National Institutes of Health (U.S.) (R01 HG008155)
Broad NextGen Award
National Institutes of Health (U.S.) (R01 GM113708)
National Institutes of Health (U.S.) (U01 HG007610)