Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data

Transposable elements (TEs) along with simple sequence repeats (SSRs) are prevalent in eukaryotic genome, especially in mammals. Repetitive sequences form approximately one-third of the camelid genomes, so study on this part of genome can be helpful in providing deeper information from the genome an...

Full description

Bibliographic Details
Main Authors: Reza Khalkhali-Evrigh, Nemat Hedayat-Evrigh, Seyed Hasan Hafezian, Ayoub Farhadi, Mohammad Reza Bakhtiarizadeh
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-07-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2019.00692/full
id doaj-a59cdeb0763d4eb293343929289126a4
record_format Article
spelling doaj-a59cdeb0763d4eb293343929289126a42020-11-25T01:19:58ZengFrontiers Media S.A.Frontiers in Genetics1664-80212019-07-011010.3389/fgene.2019.00692446518Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing DataReza Khalkhali-Evrigh0Nemat Hedayat-Evrigh1Seyed Hasan Hafezian2Ayoub Farhadi3Mohammad Reza Bakhtiarizadeh4Department of Animal Breeding and Genetics, Sari Agricultural Sciences and Natural Resources University, Sari, IranDepartment of Animal Science, University of Mohaghegh Ardabili, Ardabil, IranDepartment of Animal Breeding and Genetics, Sari Agricultural Sciences and Natural Resources University, Sari, IranDepartment of Animal Breeding and Genetics, Sari Agricultural Sciences and Natural Resources University, Sari, IranDepartment of Animal and Poultry Science, College of Aburaihan, University of Tehran, Tehran, IranTransposable elements (TEs) along with simple sequence repeats (SSRs) are prevalent in eukaryotic genome, especially in mammals. Repetitive sequences form approximately one-third of the camelid genomes, so study on this part of genome can be helpful in providing deeper information from the genome and its evolutionary path. Here, in order to improve our understanding regarding the camel genome architecture, the whole genome of the two dromedaries (Yazdi and Trodi camels) was sequenced. Totally, 92- and 84.3-Gb sequence data were obtained and assembled to 137,772 and 149,997 contigs with a N50 length of 54,626 and 54,031 bp in Yazdi and Trodi camels, respectively. Results showed that 30.58% of Yazdi camel genome and 30.50% of Trodi camel genome were covered by TEs. Contrary to the observed results in the genomes of cattle, sheep, horse, and pig, no endogenous retrovirus-K (ERVK) elements were found in the camel genome. Distribution pattern of DNA transposons in the genomes of dromedary, Bactrian, and cattle was similar in contrast with LINE, SINE, and long terminal repeat (LTR) families. Elements like RTE-BovB belonging to LINEs family in cattle and sheep genomes are dramatically higher than genome of dromedary. However, LINE1 (L1) and LINE2 (L2) elements cover higher percentage of LINE family in dromedary genome compared to genome of cattle. Also, 540,133 and 539,409 microsatellites were identified from the assembled contigs of Yazdi and Trodi dromedary camels, respectively. In both samples, di-(393,196) and tri-(65,313) nucleotide repeats contributed to about 42.5% of the microsatellites. The findings of the present study revealed that non-repetitive content of mammalian genomes is approximately similar. Results showed that 9.1 Mb (0.47% of whole assembled genome) of Iranian dromedary’s genome length is made up of SSRs. Annotation of repetitive content of Iranian dromedary camel genome revealed that 9,068 and 11,544 genes contain different types of TEs and SSRs, respectively. SSR markers identified in the present study can be used as a valuable resource for genetic diversity investigations and marker-assisted selection (MAS) in camel-breeding programs.https://www.frontiersin.org/article/10.3389/fgene.2019.00692/fullCamelus dromedariusde novo assemblyrepetitive sequencebreeding strategiesnext-generation sequencing
collection DOAJ
language English
format Article
sources DOAJ
author Reza Khalkhali-Evrigh
Nemat Hedayat-Evrigh
Seyed Hasan Hafezian
Ayoub Farhadi
Mohammad Reza Bakhtiarizadeh
spellingShingle Reza Khalkhali-Evrigh
Nemat Hedayat-Evrigh
Seyed Hasan Hafezian
Ayoub Farhadi
Mohammad Reza Bakhtiarizadeh
Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
Frontiers in Genetics
Camelus dromedarius
de novo assembly
repetitive sequence
breeding strategies
next-generation sequencing
author_facet Reza Khalkhali-Evrigh
Nemat Hedayat-Evrigh
Seyed Hasan Hafezian
Ayoub Farhadi
Mohammad Reza Bakhtiarizadeh
author_sort Reza Khalkhali-Evrigh
title Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
title_short Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
title_full Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
title_fullStr Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
title_full_unstemmed Genome-Wide Identification of Microsatellites and Transposable Elements in the Dromedary Camel Genome Using Whole-Genome Sequencing Data
title_sort genome-wide identification of microsatellites and transposable elements in the dromedary camel genome using whole-genome sequencing data
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2019-07-01
description Transposable elements (TEs) along with simple sequence repeats (SSRs) are prevalent in eukaryotic genome, especially in mammals. Repetitive sequences form approximately one-third of the camelid genomes, so study on this part of genome can be helpful in providing deeper information from the genome and its evolutionary path. Here, in order to improve our understanding regarding the camel genome architecture, the whole genome of the two dromedaries (Yazdi and Trodi camels) was sequenced. Totally, 92- and 84.3-Gb sequence data were obtained and assembled to 137,772 and 149,997 contigs with a N50 length of 54,626 and 54,031 bp in Yazdi and Trodi camels, respectively. Results showed that 30.58% of Yazdi camel genome and 30.50% of Trodi camel genome were covered by TEs. Contrary to the observed results in the genomes of cattle, sheep, horse, and pig, no endogenous retrovirus-K (ERVK) elements were found in the camel genome. Distribution pattern of DNA transposons in the genomes of dromedary, Bactrian, and cattle was similar in contrast with LINE, SINE, and long terminal repeat (LTR) families. Elements like RTE-BovB belonging to LINEs family in cattle and sheep genomes are dramatically higher than genome of dromedary. However, LINE1 (L1) and LINE2 (L2) elements cover higher percentage of LINE family in dromedary genome compared to genome of cattle. Also, 540,133 and 539,409 microsatellites were identified from the assembled contigs of Yazdi and Trodi dromedary camels, respectively. In both samples, di-(393,196) and tri-(65,313) nucleotide repeats contributed to about 42.5% of the microsatellites. The findings of the present study revealed that non-repetitive content of mammalian genomes is approximately similar. Results showed that 9.1 Mb (0.47% of whole assembled genome) of Iranian dromedary’s genome length is made up of SSRs. Annotation of repetitive content of Iranian dromedary camel genome revealed that 9,068 and 11,544 genes contain different types of TEs and SSRs, respectively. SSR markers identified in the present study can be used as a valuable resource for genetic diversity investigations and marker-assisted selection (MAS) in camel-breeding programs.
topic Camelus dromedarius
de novo assembly
repetitive sequence
breeding strategies
next-generation sequencing
url https://www.frontiersin.org/article/10.3389/fgene.2019.00692/full
work_keys_str_mv AT rezakhalkhalievrigh genomewideidentificationofmicrosatellitesandtransposableelementsinthedromedarycamelgenomeusingwholegenomesequencingdata
AT nemathedayatevrigh genomewideidentificationofmicrosatellitesandtransposableelementsinthedromedarycamelgenomeusingwholegenomesequencingdata
AT seyedhasanhafezian genomewideidentificationofmicrosatellitesandtransposableelementsinthedromedarycamelgenomeusingwholegenomesequencingdata
AT ayoubfarhadi genomewideidentificationofmicrosatellitesandtransposableelementsinthedromedarycamelgenomeusingwholegenomesequencingdata
AT mohammadrezabakhtiarizadeh genomewideidentificationofmicrosatellitesandtransposableelementsinthedromedarycamelgenomeusingwholegenomesequencingdata
_version_ 1725136211290357760