Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification

A total of 16 strains of hyperthermophilic Thermotoga complete genome sequences viz. Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and T...

Full description

Bibliographic Details
Main Authors: Bhagwan N. Rekadwad, Chandrahasya N. Khobragade
Format: Article
Language:English
Published: Elsevier 2016-09-01
Series:Data in Brief
Online Access:http://www.sciencedirect.com/science/article/pii/S235234091630333X
id doaj-3a7943945f9649d4b0d5e73b49d87c8c
record_format Article
spelling doaj-3a7943945f9649d4b0d5e73b49d87c8c2020-11-25T01:32:42ZengElsevierData in Brief2352-34092016-09-018300303Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classificationBhagwan N. Rekadwad0Chandrahasya N. Khobragade1Corresponding author.; School of Life Sciences, Swami Ramanand Teerth Marathwada University, Nanded, Maharashtra 431606, IndiaSchool of Life Sciences, Swami Ramanand Teerth Marathwada University, Nanded, Maharashtra 431606, IndiaA total of 16 strains of hyperthermophilic Thermotoga complete genome sequences viz. Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and Thermotoga thermarum (CP002351 & NC_015707) complete genome sequences were retrieved from NCBI BioSample database. ENDMEMO GC used for creation of data on GC content in Thermotoga sp. DNA sequences. Maximum GC content was observed in Thermotoga strains AE000512 & NC_000853 (69 %GC), followed by NZ_CP011108, CP011108, NZ_CP011107, NC_023151, NC_021214, CP011107 & CP004077 (68.5 %GC), followed by NZ_CP010967 & CP010967 (68.3 %GC), followed by CP000916, CP007013 & NC_011978 (68 %GC), followed by CP002351 & NC_015707 (67 %GC) strains. The use of GC dataset ratios helps in higher level hierarchical classification in Bacterial Systematics in addition to phenotypic and other genotypic characters. Keywords: ENDMEMO, GC content, Hyperthermophiles, New digital data, Whole genomehttp://www.sciencedirect.com/science/article/pii/S235234091630333X
collection DOAJ
language English
format Article
sources DOAJ
author Bhagwan N. Rekadwad
Chandrahasya N. Khobragade
spellingShingle Bhagwan N. Rekadwad
Chandrahasya N. Khobragade
Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
Data in Brief
author_facet Bhagwan N. Rekadwad
Chandrahasya N. Khobragade
author_sort Bhagwan N. Rekadwad
title Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
title_short Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
title_full Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
title_fullStr Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
title_full_unstemmed Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification
title_sort determination of gc content of thermotoga maritima, thermotoga neapolitana and thermotoga thermarum strains: a gc dataset for higher level hierarchical classification
publisher Elsevier
series Data in Brief
issn 2352-3409
publishDate 2016-09-01
description A total of 16 strains of hyperthermophilic Thermotoga complete genome sequences viz. Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and Thermotoga thermarum (CP002351 & NC_015707) complete genome sequences were retrieved from NCBI BioSample database. ENDMEMO GC used for creation of data on GC content in Thermotoga sp. DNA sequences. Maximum GC content was observed in Thermotoga strains AE000512 & NC_000853 (69 %GC), followed by NZ_CP011108, CP011108, NZ_CP011107, NC_023151, NC_021214, CP011107 & CP004077 (68.5 %GC), followed by NZ_CP010967 & CP010967 (68.3 %GC), followed by CP000916, CP007013 & NC_011978 (68 %GC), followed by CP002351 & NC_015707 (67 %GC) strains. The use of GC dataset ratios helps in higher level hierarchical classification in Bacterial Systematics in addition to phenotypic and other genotypic characters. Keywords: ENDMEMO, GC content, Hyperthermophiles, New digital data, Whole genome
url http://www.sciencedirect.com/science/article/pii/S235234091630333X
work_keys_str_mv AT bhagwannrekadwad determinationofgccontentofthermotogamaritimathermotoganeapolitanaandthermotogathermarumstrainsagcdatasetforhigherlevelhierarchicalclassification
AT chandrahasyankhobragade determinationofgccontentofthermotogamaritimathermotoganeapolitanaandthermotogathermarumstrainsagcdatasetforhigherlevelhierarchicalclassification
_version_ 1725080420823859200