Predictive long-range allele-specific mapping of regulatory variants and target transcripts.

Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from s...

Full description

Bibliographic Details
Main Authors: Kibaick Lee, Seulkee Lee, Hyoeun Bang, Jung Kyoon Choi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5391085?pdf=render
id doaj-20caff10aef84f89bebd1efec4430eb2
record_format Article
spelling doaj-20caff10aef84f89bebd1efec4430eb22020-11-25T01:36:40ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01124e017576810.1371/journal.pone.0175768Predictive long-range allele-specific mapping of regulatory variants and target transcripts.Kibaick LeeSeulkee LeeHyoeun BangJung Kyoon ChoiGenome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples.http://europepmc.org/articles/PMC5391085?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Kibaick Lee
Seulkee Lee
Hyoeun Bang
Jung Kyoon Choi
spellingShingle Kibaick Lee
Seulkee Lee
Hyoeun Bang
Jung Kyoon Choi
Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
PLoS ONE
author_facet Kibaick Lee
Seulkee Lee
Hyoeun Bang
Jung Kyoon Choi
author_sort Kibaick Lee
title Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
title_short Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
title_full Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
title_fullStr Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
title_full_unstemmed Predictive long-range allele-specific mapping of regulatory variants and target transcripts.
title_sort predictive long-range allele-specific mapping of regulatory variants and target transcripts.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples.
url http://europepmc.org/articles/PMC5391085?pdf=render
work_keys_str_mv AT kibaicklee predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT seulkeelee predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT hyoeunbang predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT jungkyoonchoi predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
_version_ 1725061655277076480