First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]
Background: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats. The whole geno...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
F1000 Research Ltd
2019-03-01
|
Series: | F1000Research |
Online Access: | https://f1000research.com/articles/8-320/v1 |
id |
doaj-f16514ef19f849809d5e0fa0729aae63 |
---|---|
record_format |
Article |
spelling |
doaj-f16514ef19f849809d5e0fa0729aae632020-11-25T02:59:45ZengF1000 Research LtdF1000Research2046-14022019-03-01810.12688/f1000research.18325.120044First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations]Md. Bazlur Rahman Mollah0Mohd Golam Quader Khan1Md Shahidul Islam2Md Samsul Alam3Poultry Biotechnology and Genomics Laboratory, Department of Poultry Science, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Fisheries Biology and Genetics, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Biotechnology, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshDepartment of Fisheries Biology and Genetics, Bangladesh Agricultural University, Mymensingh, 2202, BangladeshBackground: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats. The whole genome sequence of this valuable fish could provide genomic tools for sustainable harvest, conservation and productivity cycle maintenance. Here, we report the first draft genome of T. ilisha from the Bay of Bengal, the largest reservoir of the migratory fish. Methods: A live specimen of T. ilisha was collected from the Bay of Bengal. The whole genome sequencing was performed by the Illumina HiSeqX platform (2 × 150 paired end configuration). We assembled the short reads using SOAPdenovo2 genome assembler and predicted protein coding genes by AUGUSTUS. The completeness of the T. ilisha genome assembly was evaluated by BUSCO (Benchmarking Universal Single Copy Orthologs). We identified single nucleotide polymorphisms (SNPs) by calling them directly from unassembled sequence reads using discoSnp++. Results: We assembled the draft genome of 710.28 Mb having an N50 scaffold length of 64157 bp and GC content of 42.95%. A total of 37,450 protein coding genes were predicted of which 29,339 (78.34%) were annotated with other vertebrate genomes. We also identified 792,939 isolated SNPs with transversion:transition ratio of 1:1.8. The BUSCO evaluation showed 78.1% completeness of this genome. Conclusions: The genomic data generated in this study could be used as a reference to identify genes associated with physiological and ecological adaptations, population connectivity, and migration behaviour of this biologically and economically important anadromous fish species of the Clupeidae family.https://f1000research.com/articles/8-320/v1 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Md. Bazlur Rahman Mollah Mohd Golam Quader Khan Md Shahidul Islam Md Samsul Alam |
spellingShingle |
Md. Bazlur Rahman Mollah Mohd Golam Quader Khan Md Shahidul Islam Md Samsul Alam First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] F1000Research |
author_facet |
Md. Bazlur Rahman Mollah Mohd Golam Quader Khan Md Shahidul Islam Md Samsul Alam |
author_sort |
Md. Bazlur Rahman Mollah |
title |
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
title_short |
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
title_full |
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
title_fullStr |
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
title_full_unstemmed |
First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
title_sort |
first draft genome assembly and identification of snps from hilsa shad (tenualosa ilisha) of the bay of bengal [version 1; peer review: 1 approved, 2 approved with reservations] |
publisher |
F1000 Research Ltd |
series |
F1000Research |
issn |
2046-1402 |
publishDate |
2019-03-01 |
description |
Background: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats. The whole genome sequence of this valuable fish could provide genomic tools for sustainable harvest, conservation and productivity cycle maintenance. Here, we report the first draft genome of T. ilisha from the Bay of Bengal, the largest reservoir of the migratory fish. Methods: A live specimen of T. ilisha was collected from the Bay of Bengal. The whole genome sequencing was performed by the Illumina HiSeqX platform (2 × 150 paired end configuration). We assembled the short reads using SOAPdenovo2 genome assembler and predicted protein coding genes by AUGUSTUS. The completeness of the T. ilisha genome assembly was evaluated by BUSCO (Benchmarking Universal Single Copy Orthologs). We identified single nucleotide polymorphisms (SNPs) by calling them directly from unassembled sequence reads using discoSnp++. Results: We assembled the draft genome of 710.28 Mb having an N50 scaffold length of 64157 bp and GC content of 42.95%. A total of 37,450 protein coding genes were predicted of which 29,339 (78.34%) were annotated with other vertebrate genomes. We also identified 792,939 isolated SNPs with transversion:transition ratio of 1:1.8. The BUSCO evaluation showed 78.1% completeness of this genome. Conclusions: The genomic data generated in this study could be used as a reference to identify genes associated with physiological and ecological adaptations, population connectivity, and migration behaviour of this biologically and economically important anadromous fish species of the Clupeidae family. |
url |
https://f1000research.com/articles/8-320/v1 |
work_keys_str_mv |
AT mdbazlurrahmanmollah firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations AT mohdgolamquaderkhan firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations AT mdshahidulislam firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations AT mdsamsulalam firstdraftgenomeassemblyandidentificationofsnpsfromhilsashadtenualosailishaofthebayofbengalversion1peerreview1approved2approvedwithreservations |
_version_ |
1724701216720551936 |