Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance

Abstract Background Two gerbil species, sand rat (Psammomys obesus) and Mongolian jird (Meriones unguiculatus), can become obese and show signs of metabolic dysregulation when maintained on standard laboratory diets. The genetic basis of this phenotype is unknown. Recently, genome sequencing has unc...

Full description

Bibliographic Details
Main Authors: Yichen Dai, Rodrigo Pracana, Peter W. H. Holland
Format: Article
Language:English
Published: BMC 2020-10-01
Series:BMC Evolutionary Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12862-020-01696-3
id doaj-8f2a04a7adf349ff9ec66e38aa6474a9
record_format Article
spelling doaj-8f2a04a7adf349ff9ec66e38aa6474a92021-09-02T17:36:12ZengBMCBMC Evolutionary Biology1471-21482020-10-0120111510.1186/s12862-020-01696-3Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevanceYichen Dai0Rodrigo Pracana1Peter W. H. Holland2Department of Zoology, University of OxfordDepartment of Zoology, University of OxfordDepartment of Zoology, University of OxfordAbstract Background Two gerbil species, sand rat (Psammomys obesus) and Mongolian jird (Meriones unguiculatus), can become obese and show signs of metabolic dysregulation when maintained on standard laboratory diets. The genetic basis of this phenotype is unknown. Recently, genome sequencing has uncovered very unusual regions of high guanine and cytosine (GC) content scattered across the sand rat genome, most likely generated by extreme and localized biased gene conversion. A key pancreatic transcription factor PDX1 is encoded by a gene in the most extreme GC-rich region, is remarkably divergent and exhibits altered biochemical properties. Here, we ask if gerbils have proteins in addition to PDX1 that are aberrantly divergent in amino acid sequence, whether they have also become divergent due to GC-biased nucleotide changes, and whether these proteins could plausibly be connected to metabolic dysfunction exhibited by gerbils. Results We analyzed ~ 10,000 proteins with 1-to-1 orthologues in human and rodents and identified 50 proteins that accumulated unusually high levels of amino acid change in the sand rat and 41 in Mongolian jird. We show that more than half of the aberrantly divergent proteins are associated with GC biased nucleotide change and many are in previously defined high GC regions. We highlight four aberrantly divergent gerbil proteins, PDX1, INSR, MEDAG and SPP1, that may plausibly be associated with dietary metabolism. Conclusions We show that through the course of gerbil evolution, many aberrantly divergent proteins have accumulated in the gerbil lineage, and GC-biased nucleotide substitution rather than positive selection is the likely cause of extreme divergence in more than half of these. Some proteins carry putatively deleterious changes that could be associated with metabolic and physiological phenotypes observed in some gerbil species. We propose that these animals provide a useful model to study the ‘tug-of-war’ between natural selection and the excessive accumulation of deleterious substitutions mutations through biased gene conversion.http://link.springer.com/article/10.1186/s12862-020-01696-3gBGCGC biasGenome evolutionInsulin receptorMedagMetabolism
collection DOAJ
language English
format Article
sources DOAJ
author Yichen Dai
Rodrigo Pracana
Peter W. H. Holland
spellingShingle Yichen Dai
Rodrigo Pracana
Peter W. H. Holland
Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
BMC Evolutionary Biology
gBGC
GC bias
Genome evolution
Insulin receptor
Medag
Metabolism
author_facet Yichen Dai
Rodrigo Pracana
Peter W. H. Holland
author_sort Yichen Dai
title Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
title_short Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
title_full Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
title_fullStr Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
title_full_unstemmed Divergent genes in gerbils: prevalence, relation to GC-biased substitution, and phenotypic relevance
title_sort divergent genes in gerbils: prevalence, relation to gc-biased substitution, and phenotypic relevance
publisher BMC
series BMC Evolutionary Biology
issn 1471-2148
publishDate 2020-10-01
description Abstract Background Two gerbil species, sand rat (Psammomys obesus) and Mongolian jird (Meriones unguiculatus), can become obese and show signs of metabolic dysregulation when maintained on standard laboratory diets. The genetic basis of this phenotype is unknown. Recently, genome sequencing has uncovered very unusual regions of high guanine and cytosine (GC) content scattered across the sand rat genome, most likely generated by extreme and localized biased gene conversion. A key pancreatic transcription factor PDX1 is encoded by a gene in the most extreme GC-rich region, is remarkably divergent and exhibits altered biochemical properties. Here, we ask if gerbils have proteins in addition to PDX1 that are aberrantly divergent in amino acid sequence, whether they have also become divergent due to GC-biased nucleotide changes, and whether these proteins could plausibly be connected to metabolic dysfunction exhibited by gerbils. Results We analyzed ~ 10,000 proteins with 1-to-1 orthologues in human and rodents and identified 50 proteins that accumulated unusually high levels of amino acid change in the sand rat and 41 in Mongolian jird. We show that more than half of the aberrantly divergent proteins are associated with GC biased nucleotide change and many are in previously defined high GC regions. We highlight four aberrantly divergent gerbil proteins, PDX1, INSR, MEDAG and SPP1, that may plausibly be associated with dietary metabolism. Conclusions We show that through the course of gerbil evolution, many aberrantly divergent proteins have accumulated in the gerbil lineage, and GC-biased nucleotide substitution rather than positive selection is the likely cause of extreme divergence in more than half of these. Some proteins carry putatively deleterious changes that could be associated with metabolic and physiological phenotypes observed in some gerbil species. We propose that these animals provide a useful model to study the ‘tug-of-war’ between natural selection and the excessive accumulation of deleterious substitutions mutations through biased gene conversion.
topic gBGC
GC bias
Genome evolution
Insulin receptor
Medag
Metabolism
url http://link.springer.com/article/10.1186/s12862-020-01696-3
work_keys_str_mv AT yichendai divergentgenesingerbilsprevalencerelationtogcbiasedsubstitutionandphenotypicrelevance
AT rodrigopracana divergentgenesingerbilsprevalencerelationtogcbiasedsubstitutionandphenotypicrelevance
AT peterwhholland divergentgenesingerbilsprevalencerelationtogcbiasedsubstitutionandphenotypicrelevance
_version_ 1721172081318559744