SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.

We have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw r...

Full description

Bibliographic Details
Main Authors: Shan Lin, Hongyan Zhang, Yali Hou, Lin Liu, Wenhui Li, Jianping Jiang, Bo Han, Shengli Zhang, Dongxiao Sun
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0220629
id doaj-f9b6ca71ac46497ea146c7f11737d7e2
record_format Article
spelling doaj-f9b6ca71ac46497ea146c7f11737d7e22021-03-03T21:09:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01148e022062910.1371/journal.pone.0220629SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.Shan LinHongyan ZhangYali HouLin LiuWenhui LiJianping JiangBo HanShengli ZhangDongxiao SunWe have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw reads were obtained with an average effective depth of 8.1×. After single nucleotide variant (SNV) calling, total 10,961,243 SNVs were identified, and 57,451 of them showed opposite fixed sites between the bulls with high and low EBVs within each family (called as common differential SNVs). Next, we annotated the common differential SNVs based on the bovine reference genome, and observed that 45,188 SNVs (78.70%) were located in the intergenic region of genes and merely 11,871 SNVs (20.67%) located within the protein-coding genes. Of them, 13,099 common differential SNVs that were within or close to protein-coding genes with less than 5 kb were chosen for identification of candidate genes for milk compositions in dairy cattle. By integrated analysis of the 2,657 genes with the GO terms and pathways related to protein and fat metabolism, and the known quantitative trait loci (QTLs) for milk protein and fat traits, we identified 17 promising candidate genes: ALG14, ATP2C1, PLD1, C3H1orf85, SNX7, MTHFD2L, CDKN2D, COL5A3, FDX1L, PIN1, FIG4, EXOC7, LASP1, PGS1, SAO, GPLD1 and MGEA5. Our findings provided an important foundation for further study and a prompt for molecular breeding of dairy cattle.https://doi.org/10.1371/journal.pone.0220629
collection DOAJ
language English
format Article
sources DOAJ
author Shan Lin
Hongyan Zhang
Yali Hou
Lin Liu
Wenhui Li
Jianping Jiang
Bo Han
Shengli Zhang
Dongxiao Sun
spellingShingle Shan Lin
Hongyan Zhang
Yali Hou
Lin Liu
Wenhui Li
Jianping Jiang
Bo Han
Shengli Zhang
Dongxiao Sun
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
PLoS ONE
author_facet Shan Lin
Hongyan Zhang
Yali Hou
Lin Liu
Wenhui Li
Jianping Jiang
Bo Han
Shengli Zhang
Dongxiao Sun
author_sort Shan Lin
title SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
title_short SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
title_full SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
title_fullStr SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
title_full_unstemmed SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
title_sort snv discovery and functional candidate gene identification for milk composition based on whole genome resequencing of holstein bulls with extremely high and low breeding values.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description We have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw reads were obtained with an average effective depth of 8.1×. After single nucleotide variant (SNV) calling, total 10,961,243 SNVs were identified, and 57,451 of them showed opposite fixed sites between the bulls with high and low EBVs within each family (called as common differential SNVs). Next, we annotated the common differential SNVs based on the bovine reference genome, and observed that 45,188 SNVs (78.70%) were located in the intergenic region of genes and merely 11,871 SNVs (20.67%) located within the protein-coding genes. Of them, 13,099 common differential SNVs that were within or close to protein-coding genes with less than 5 kb were chosen for identification of candidate genes for milk compositions in dairy cattle. By integrated analysis of the 2,657 genes with the GO terms and pathways related to protein and fat metabolism, and the known quantitative trait loci (QTLs) for milk protein and fat traits, we identified 17 promising candidate genes: ALG14, ATP2C1, PLD1, C3H1orf85, SNX7, MTHFD2L, CDKN2D, COL5A3, FDX1L, PIN1, FIG4, EXOC7, LASP1, PGS1, SAO, GPLD1 and MGEA5. Our findings provided an important foundation for further study and a prompt for molecular breeding of dairy cattle.
url https://doi.org/10.1371/journal.pone.0220629
work_keys_str_mv AT shanlin snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT hongyanzhang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT yalihou snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT linliu snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT wenhuili snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT jianpingjiang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT bohan snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT shenglizhang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
AT dongxiaosun snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues
_version_ 1714818618244464640