Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.

The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of "missing heritability"--the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants...

Full description

Bibliographic Details
Main Authors: Oscar Gonzalez-Recio, Hans D Daetwyler, Iona M MacLeod, Jennie E Pryce, Phil J Bowman, Ben J Hayes, Michael E Goddard
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4671594?pdf=render
id doaj-a8483c3228154c3abfa24894305e03ba
record_format Article
spelling doaj-a8483c3228154c3abfa24894305e03ba2020-11-24T21:39:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011012e014394510.1371/journal.pone.0143945Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.Oscar Gonzalez-RecioHans D DaetwylerIona M MacLeodJennie E PrycePhil J BowmanBen J HayesMichael E GoddardThe proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of "missing heritability"--the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.http://europepmc.org/articles/PMC4671594?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Oscar Gonzalez-Recio
Hans D Daetwyler
Iona M MacLeod
Jennie E Pryce
Phil J Bowman
Ben J Hayes
Michael E Goddard
spellingShingle Oscar Gonzalez-Recio
Hans D Daetwyler
Iona M MacLeod
Jennie E Pryce
Phil J Bowman
Ben J Hayes
Michael E Goddard
Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
PLoS ONE
author_facet Oscar Gonzalez-Recio
Hans D Daetwyler
Iona M MacLeod
Jennie E Pryce
Phil J Bowman
Ben J Hayes
Michael E Goddard
author_sort Oscar Gonzalez-Recio
title Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
title_short Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
title_full Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
title_fullStr Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
title_full_unstemmed Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle.
title_sort rare variants in transcript and potential regulatory regions explain a small percentage of the missing heritability of complex traits in cattle.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of "missing heritability"--the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.
url http://europepmc.org/articles/PMC4671594?pdf=render
work_keys_str_mv AT oscargonzalezrecio rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT hansddaetwyler rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT ionammacleod rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT jennieepryce rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT philjbowman rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT benjhayes rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT michaelegoddard rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
_version_ 1725933190077480960