Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle

Abstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy...

Full description

Bibliographic Details
Main Authors: Toshimi Baba, Sara Pegolo, Lucio F. M. Mota, Francisco Peñagaricano, Giovanni Bittante, Alessio Cecchinato, Gota Morota
Format: Article
Language:deu
Published: BMC 2021-03-01
Series:Genetics Selection Evolution
Online Access:https://doi.org/10.1186/s12711-021-00620-7
id doaj-841a91d42b344662a64d134c7542969c
record_format Article
spelling doaj-841a91d42b344662a64d134c7542969c2021-03-21T12:20:52ZdeuBMCGenetics Selection Evolution1297-96862021-03-0153111410.1186/s12711-021-00620-7Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattleToshimi Baba0Sara Pegolo1Lucio F. M. Mota2Francisco Peñagaricano3Giovanni Bittante4Alessio Cecchinato5Gota Morota6Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State UniversityDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Animal and Dairy Sciences, University of Wisconsin-MadisonDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of PadovaDepartment of Animal and Poultry Sciences, Virginia Polytechnic Institute and State UniversityAbstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.https://doi.org/10.1186/s12711-021-00620-7
collection DOAJ
language deu
format Article
sources DOAJ
author Toshimi Baba
Sara Pegolo
Lucio F. M. Mota
Francisco Peñagaricano
Giovanni Bittante
Alessio Cecchinato
Gota Morota
spellingShingle Toshimi Baba
Sara Pegolo
Lucio F. M. Mota
Francisco Peñagaricano
Giovanni Bittante
Alessio Cecchinato
Gota Morota
Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
Genetics Selection Evolution
author_facet Toshimi Baba
Sara Pegolo
Lucio F. M. Mota
Francisco Peñagaricano
Giovanni Bittante
Alessio Cecchinato
Gota Morota
author_sort Toshimi Baba
title Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
title_short Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
title_full Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
title_fullStr Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
title_full_unstemmed Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
title_sort integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle
publisher BMC
series Genetics Selection Evolution
issn 1297-9686
publishDate 2021-03-01
description Abstract Background Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.
url https://doi.org/10.1186/s12711-021-00620-7
work_keys_str_mv AT toshimibaba integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT sarapegolo integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT luciofmmota integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT franciscopenagaricano integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT giovannibittante integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT alessiocecchinato integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
AT gotamorota integratinggenomicandinfraredspectraldataimprovesthepredictionofmilkproteincompositionindairycattle
_version_ 1724210651273887744