Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
Abstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | deu |
Published: |
BMC
2021-03-01
|
Series: | Genetics Selection Evolution |
Online Access: | https://doi.org/10.1186/s12711-021-00620-7 |
id |
doaj-841a91d42b344662a64d134c7542969c |
---|---|
record_format |
Article |
spelling |
doaj-841a91d42b344662a64d134c7542969c2021-03-21T12:20:52ZdeuBMCGenetics Selection Evolution1297-96862021-03-0153111410.1186/s12711-021-00620-7Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattleToshimi Baba0Sara Pegolo1Lucio F. M. Mota2Francisco Peñagaricano3Giovanni Bittante4Alessio Cecchinato5Gota Morota6Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State UniversityDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Animal and Dairy Sciences, University of Wisconsin-MadisonDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Animal and Poultry Sciences, Virginia Polytechnic Institute and State UniversityAbstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.https://doi.org/10.1186/s12711-021-00620-7 |
collection |
DOAJ |
language |
deu |
format |
Article |
sources |
DOAJ |
author |
Toshimi Baba Sara Pegolo Lucio F. M. Mota Francisco Peñagaricano Giovanni Bittante Alessio Cecchinato Gota Morota |
spellingShingle |
Toshimi Baba Sara Pegolo Lucio F. M. Mota Francisco Peñagaricano Giovanni Bittante Alessio Cecchinato Gota Morota Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle Genetics Selection Evolution |
author_facet |
Toshimi Baba Sara Pegolo Lucio F. M. Mota Francisco Peñagaricano Giovanni Bittante Alessio Cecchinato Gota Morota |
author_sort |
Toshimi Baba |
title |
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
title_short |
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
title_full |
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
title_fullStr |
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
title_full_unstemmed |
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
title_sort |
integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle |
publisher |
BMC |
series |
Genetics Selection Evolution |
issn |
1297-9686 |
publishDate |
2021-03-01 |
description |
Abstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions. |
url |
https://doi.org/10.1186/s12711-021-00620-7 |
work_keys_str_mv |
AT toshimibaba integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT sarapegolo integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT luciofmmota integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT franciscopenagaricano integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT giovannibittante integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT alessiocecchinato integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle AT gotamorota integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle |
_version_ |
1724210651273887744 |