SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.
We have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw r...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0220629 |
id |
doaj-f9b6ca71ac46497ea146c7f11737d7e2 |
---|---|
record_format |
Article |
spelling |
doaj-f9b6ca71ac46497ea146c7f11737d7e22021-03-03T21:09:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01148e022062910.1371/journal.pone.0220629SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values.Shan LinHongyan ZhangYali HouLin LiuWenhui LiJianping JiangBo HanShengli ZhangDongxiao SunWe have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw reads were obtained with an average effective depth of 8.1×. After single nucleotide variant (SNV) calling, total 10,961,243 SNVs were identified, and 57,451 of them showed opposite fixed sites between the bulls with high and low EBVs within each family (called as common differential SNVs). Next, we annotated the common differential SNVs based on the bovine reference genome, and observed that 45,188 SNVs (78.70%) were located in the intergenic region of genes and merely 11,871 SNVs (20.67%) located within the protein-coding genes. Of them, 13,099 common differential SNVs that were within or close to protein-coding genes with less than 5 kb were chosen for identification of candidate genes for milk compositions in dairy cattle. By integrated analysis of the 2,657 genes with the GO terms and pathways related to protein and fat metabolism, and the known quantitative trait loci (QTLs) for milk protein and fat traits, we identified 17 promising candidate genes: ALG14, ATP2C1, PLD1, C3H1orf85, SNX7, MTHFD2L, CDKN2D, COL5A3, FDX1L, PIN1, FIG4, EXOC7, LASP1, PGS1, SAO, GPLD1 and MGEA5. Our findings provided an important foundation for further study and a prompt for molecular breeding of dairy cattle.https://doi.org/10.1371/journal.pone.0220629 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shan Lin Hongyan Zhang Yali Hou Lin Liu Wenhui Li Jianping Jiang Bo Han Shengli Zhang Dongxiao Sun |
spellingShingle |
Shan Lin Hongyan Zhang Yali Hou Lin Liu Wenhui Li Jianping Jiang Bo Han Shengli Zhang Dongxiao Sun SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. PLoS ONE |
author_facet |
Shan Lin Hongyan Zhang Yali Hou Lin Liu Wenhui Li Jianping Jiang Bo Han Shengli Zhang Dongxiao Sun |
author_sort |
Shan Lin |
title |
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. |
title_short |
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. |
title_full |
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. |
title_fullStr |
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. |
title_full_unstemmed |
SNV discovery and functional candidate gene identification for milk composition based on whole genome resequencing of Holstein bulls with extremely high and low breeding values. |
title_sort |
snv discovery and functional candidate gene identification for milk composition based on whole genome resequencing of holstein bulls with extremely high and low breeding values. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2019-01-01 |
description |
We have sequenced the whole genomes of eight proven Holstein bulls from the four half-sib or full-sib families with extremely high and low estimated breeding values (EBV) for milk protein percentage (PP) and fat percentage (FP) using Illumina re-sequencing technology. Consequently, 2.3 billion raw reads were obtained with an average effective depth of 8.1×. After single nucleotide variant (SNV) calling, total 10,961,243 SNVs were identified, and 57,451 of them showed opposite fixed sites between the bulls with high and low EBVs within each family (called as common differential SNVs). Next, we annotated the common differential SNVs based on the bovine reference genome, and observed that 45,188 SNVs (78.70%) were located in the intergenic region of genes and merely 11,871 SNVs (20.67%) located within the protein-coding genes. Of them, 13,099 common differential SNVs that were within or close to protein-coding genes with less than 5 kb were chosen for identification of candidate genes for milk compositions in dairy cattle. By integrated analysis of the 2,657 genes with the GO terms and pathways related to protein and fat metabolism, and the known quantitative trait loci (QTLs) for milk protein and fat traits, we identified 17 promising candidate genes: ALG14, ATP2C1, PLD1, C3H1orf85, SNX7, MTHFD2L, CDKN2D, COL5A3, FDX1L, PIN1, FIG4, EXOC7, LASP1, PGS1, SAO, GPLD1 and MGEA5. Our findings provided an important foundation for further study and a prompt for molecular breeding of dairy cattle. |
url |
https://doi.org/10.1371/journal.pone.0220629 |
work_keys_str_mv |
AT shanlin snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT hongyanzhang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT yalihou snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT linliu snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT wenhuili snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT jianpingjiang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT bohan snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT shenglizhang snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues AT dongxiaosun snvdiscoveryandfunctionalcandidategeneidentificationformilkcompositionbasedonwholegenomeresequencingofholsteinbullswithextremelyhighandlowbreedingvalues |
_version_ |
1714818618244464640 |