Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]

The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in global beef cattl...

Full description

Bibliographic Details
Main Authors: Michael P. Heaton, Timothy P.L. Smith, Jacky K. Carnahan, Veronica Basnayake, Jiansheng Qiu, Barry Simpson, Theodore S. Kalbfleisch
Format: Article
Language:English
Published: F1000 Research Ltd 2016-10-01
Series:F1000Research
Subjects:
Online Access:http://f1000research.com/articles/5-2003/v2
id doaj-b593d31d6bbc4d7fb4e139f7b5f0556f
record_format Article
spelling doaj-b593d31d6bbc4d7fb4e139f7b5f0556f2020-11-25T03:32:09ZengF1000 Research LtdF1000Research2046-14022016-10-01510.12688/f1000research.9254.210467Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]Michael P. Heaton0Timothy P.L. Smith1Jacky K. Carnahan2Veronica Basnayake3Jiansheng Qiu4Barry Simpson5Theodore S. Kalbfleisch6U.S. Meat Animal Research Center (USMARC), Clay Center, USAU.S. Meat Animal Research Center (USMARC), Clay Center, USAU.S. Meat Animal Research Center (USMARC), Clay Center, USAGeneSeek, a Neogen Company, Lincoln, USAGeneSeek, a Neogen Company, Lincoln, USAGeneSeek, a Neogen Company, Lincoln, USADepartment of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, USAThe availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in global beef cattle. Thus, our first aim was to use 96 beef sires, sharing minimal pedigree relationships, to create a searchable and publicly viewable set of mapped genomes relevant for 19 popular breeds of U.S. cattle. Our second aim was to identify protein variants encoded by the bovine endothelial PAS domain-containing protein 1 gene (EPAS1), a gene associated with pulmonary hypertension in Angus cattle. The identity and quality of genomic sequences were verified by comparing WGS genotypes to those derived from other methods. The average read depth, genotype scoring rate, and genotype accuracy exceeded 14, 99%, and 99%, respectively. The 96 genomes were used to discover four amino acid variants encoded by EPAS1 (E270Q, P362L, A671G, and L701F) and confirm two variants previously associated with disease (A606T and G610S). The six EPAS1 missense mutations were verified with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry assays, and their frequencies were estimated in a separate collection of 1154 U.S. cattle representing 46 breeds. A rooted phylogenetic tree of eight polypeptide sequences provided a framework for evaluating the likely order of mutations and potential impact of EPAS1 alleles on the adaptive response to chronic hypoxia in U.S. cattle. This public, whole genome resource facilitates in silico identification of protein variants in diverse types of U.S. beef cattle, and provides a means of translating WGS data into a practical biological and evolutionary context for generating and testing hypotheses.http://f1000research.com/articles/5-2003/v2Agriculture & BiotechnologyGenomics
collection DOAJ
language English
format Article
sources DOAJ
author Michael P. Heaton
Timothy P.L. Smith
Jacky K. Carnahan
Veronica Basnayake
Jiansheng Qiu
Barry Simpson
Theodore S. Kalbfleisch
spellingShingle Michael P. Heaton
Timothy P.L. Smith
Jacky K. Carnahan
Veronica Basnayake
Jiansheng Qiu
Barry Simpson
Theodore S. Kalbfleisch
Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
F1000Research
Agriculture & Biotechnology
Genomics
author_facet Michael P. Heaton
Timothy P.L. Smith
Jacky K. Carnahan
Veronica Basnayake
Jiansheng Qiu
Barry Simpson
Theodore S. Kalbfleisch
author_sort Michael P. Heaton
title Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
title_short Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
title_full Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
title_fullStr Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
title_full_unstemmed Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
title_sort using diverse u.s. beef cattle genomes to identify missense mutations in epas1, a gene associated with pulmonary hypertension [version 2; referees: 2 approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2016-10-01
description The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in global beef cattle. Thus, our first aim was to use 96 beef sires, sharing minimal pedigree relationships, to create a searchable and publicly viewable set of mapped genomes relevant for 19 popular breeds of U.S. cattle. Our second aim was to identify protein variants encoded by the bovine endothelial PAS domain-containing protein 1 gene (EPAS1), a gene associated with pulmonary hypertension in Angus cattle. The identity and quality of genomic sequences were verified by comparing WGS genotypes to those derived from other methods. The average read depth, genotype scoring rate, and genotype accuracy exceeded 14, 99%, and 99%, respectively. The 96 genomes were used to discover four amino acid variants encoded by EPAS1 (E270Q, P362L, A671G, and L701F) and confirm two variants previously associated with disease (A606T and G610S). The six EPAS1 missense mutations were verified with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry assays, and their frequencies were estimated in a separate collection of 1154 U.S. cattle representing 46 breeds. A rooted phylogenetic tree of eight polypeptide sequences provided a framework for evaluating the likely order of mutations and potential impact of EPAS1 alleles on the adaptive response to chronic hypoxia in U.S. cattle. This public, whole genome resource facilitates in silico identification of protein variants in diverse types of U.S. beef cattle, and provides a means of translating WGS data into a practical biological and evolutionary context for generating and testing hypotheses.
topic Agriculture & Biotechnology
Genomics
url http://f1000research.com/articles/5-2003/v2
work_keys_str_mv AT michaelpheaton usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT timothyplsmith usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT jackykcarnahan usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT veronicabasnayake usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT jianshengqiu usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT barrysimpson usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
AT theodoreskalbfleisch usingdiverseusbeefcattlegenomestoidentifymissensemutationsinepas1ageneassociatedwithpulmonaryhypertensionversion2referees2approved
_version_ 1724569284901863424