Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

<p>Abstract</p> <p>Background</p> <p>The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from hu...

Full description

Bibliographic Details
Main Authors: Bendixen Christian, Hornshøj Henrik, Hobolth Asger, Jørgensen Frank, Fredholm Merete, Schierup Mikkel
Format: Article
Language:English
Published: BMC 2005-01-01
Series:BMC Biology
Online Access:http://www.biomedcentral.com/1741-7007/3/2
id doaj-8146923a03874bb4b495abdd20983978
record_format Article
spelling doaj-8146923a03874bb4b495abdd209839782020-11-25T00:37:53ZengBMCBMC Biology1741-70072005-01-0131210.1186/1741-7007-3-2Comparative analysis of protein coding sequences from human, mouse and the domesticated pigBendixen ChristianHornshøj HenrikHobolth AsgerJørgensen FrankFredholm MereteSchierup Mikkel<p>Abstract</p> <p>Background</p> <p>The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish <it>Fugu rubrices </it>in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution.</p> <p>Results</p> <p>We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species.</p> <p>Conclusions</p> <p>We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies.</p> http://www.biomedcentral.com/1741-7007/3/2
collection DOAJ
language English
format Article
sources DOAJ
author Bendixen Christian
Hornshøj Henrik
Hobolth Asger
Jørgensen Frank
Fredholm Merete
Schierup Mikkel
spellingShingle Bendixen Christian
Hornshøj Henrik
Hobolth Asger
Jørgensen Frank
Fredholm Merete
Schierup Mikkel
Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
BMC Biology
author_facet Bendixen Christian
Hornshøj Henrik
Hobolth Asger
Jørgensen Frank
Fredholm Merete
Schierup Mikkel
author_sort Bendixen Christian
title Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_short Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_full Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_fullStr Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_full_unstemmed Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_sort comparative analysis of protein coding sequences from human, mouse and the domesticated pig
publisher BMC
series BMC Biology
issn 1741-7007
publishDate 2005-01-01
description <p>Abstract</p> <p>Background</p> <p>The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish <it>Fugu rubrices </it>in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution.</p> <p>Results</p> <p>We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species.</p> <p>Conclusions</p> <p>We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies.</p>
url http://www.biomedcentral.com/1741-7007/3/2
work_keys_str_mv AT bendixenchristian comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT hornshøjhenrik comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT hobolthasger comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT jørgensenfrank comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT fredholmmerete comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT schierupmikkel comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
_version_ 1725299157936111616