First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]

Background: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats.  The whole geno...

Full description

Bibliographic Details
Main Authors: Md. Bazlur Rahman Mollah, Mohd Golam Quader Khan, Md Shahidul Islam, Md Samsul Alam
Format: Article
Language:English
Published: F1000 Research Ltd 2019-03-01
Series:F1000Research
Online Access:https://f1000research.com/articles/8-320/v1
id doaj-f16514ef19f849809d5e0fa0729aae63
record_format Article
spelling doaj-f16514ef19f849809d5e0fa0729aae632020-11-25T02:59:45ZengF1000 Research LtdF1000Research2046-14022019-03-01810.12688/f1000research.18325.120044First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]Md. Bazlur Rahman Mollah0Mohd Golam Quader Khan1Md Shahidul Islam2Md Samsul Alam3Poultry Biotechnology and Genomics Laboratory, Department of Poultry Science, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Fisheries Biology and Genetics, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Biotechnology, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Fisheries Biology and Genetics, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshBackground: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats.  The whole genome sequence of this valuable fish could provide genomic tools for sustainable harvest, conservation and productivity cycle maintenance. Here, we report the first draft genome of T. ilisha from the Bay of Bengal, the largest reservoir of the migratory fish. Methods: A live specimen of T. ilisha was collected from the Bay of Bengal. The whole genome sequencing was performed by the Illumina HiSeqX platform (2 × 150 paired end configuration). We assembled the short reads using SOAPdenovo2 genome assembler and predicted protein coding genes by AUGUSTUS. The completeness of the T. ilisha genome assembly was evaluated by BUSCO (Benchmarking Universal Single Copy Orthologs). We identified single nucleotide polymorphisms (SNPs) by calling them directly from unassembled sequence reads using discoSnp++. Results: We assembled the draft genome of 710.28 Mb having an N50 scaffold length of 64157 bp and GC content of 42.95%. A total of 37,450 protein coding genes were predicted of which 29,339 (78.34%) were annotated with other vertebrate genomes. We also identified 792,939 isolated SNPs with transversion:transition ratio of 1:1.8. The BUSCO evaluation showed 78.1% completeness of this genome. Conclusions: The genomic data generated in this study could be used as a reference to identify genes associated with physiological and ecological adaptations, population connectivity, and migration behaviour of this biologically and economically important anadromous fish species of the Clupeidae family.https://f1000research.com/articles/8-320/v1
collection DOAJ
language English
format Article
sources DOAJ
author Md. Bazlur Rahman Mollah
Mohd Golam Quader Khan
Md Shahidul Islam
Md Samsul Alam
spellingShingle Md. Bazlur Rahman Mollah
Mohd Golam Quader Khan
Md Shahidul Islam
Md Samsul Alam
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
F1000Research
author_facet Md. Bazlur Rahman Mollah
Mohd Golam Quader Khan
Md Shahidul Islam
Md Samsul Alam
author_sort Md. Bazlur Rahman Mollah
title First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
title_short First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
title_full First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
title_fullStr First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
title_full_unstemmed First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
title_sort first draft genome assembly and identification of snps from hilsa shad (tenualosa ilisha) of the bay of bengal [version 1; peer review: 1 approved, 2 approved with reservations]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2019-03-01
description Background: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats.  The whole genome sequence of this valuable fish could provide genomic tools for sustainable harvest, conservation and productivity cycle maintenance. Here, we report the first draft genome of T. ilisha from the Bay of Bengal, the largest reservoir of the migratory fish. Methods: A live specimen of T. ilisha was collected from the Bay of Bengal. The whole genome sequencing was performed by the Illumina HiSeqX platform (2 × 150 paired end configuration). We assembled the short reads using SOAPdenovo2 genome assembler and predicted protein coding genes by AUGUSTUS. The completeness of the T. ilisha genome assembly was evaluated by BUSCO (Benchmarking Universal Single Copy Orthologs). We identified single nucleotide polymorphisms (SNPs) by calling them directly from unassembled sequence reads using discoSnp++. Results: We assembled the draft genome of 710.28 Mb having an N50 scaffold length of 64157 bp and GC content of 42.95%. A total of 37,450 protein coding genes were predicted of which 29,339 (78.34%) were annotated with other vertebrate genomes. We also identified 792,939 isolated SNPs with transversion:transition ratio of 1:1.8. The BUSCO evaluation showed 78.1% completeness of this genome. Conclusions: The genomic data generated in this study could be used as a reference to identify genes associated with physiological and ecological adaptations, population connectivity, and migration behaviour of this biologically and economically important anadromous fish species of the Clupeidae family.
url https://f1000research.com/articles/8-320/v1
work_keys_str_mv AT mdbazlurrahmanmollah firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations
AT mohdgolamquaderkhan firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations
AT mdshahidulislam firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations
AT mdsamsulalam firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations
_version_ 1724701216720551936