Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome
The complete mitochondrial (mt) genome is sequenced in 2 individuals of the Cherskii’s sculpin Cottus czerskii . A surprisingly high level of sequence divergence (10.3%) has been detected between the 2 genomes of C czerskii studied here and the GenBank mt genome of C czerskii (KJ956027). At the same...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2017-08-01
|
Series: | Evolutionary Bioinformatics |
Online Access: | https://doi.org/10.1177/1176934317726783 |
id |
doaj-8cc3a5e42d464684850946df857a4344 |
---|---|
record_format |
Article |
spelling |
doaj-8cc3a5e42d464684850946df857a43442020-11-25T04:00:20ZengSAGE PublishingEvolutionary Bioinformatics1176-93432017-08-011310.1177/1176934317726783Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial GenomeEvgeniy S Balakirev0Pavel A Saveliev1Francisco J Ayala2School of Natural Sciences, Far Eastern Federal University, Vladivostok, RussiaA.V. Zhirmunsky Institute of Marine Biology, National Scientific Center of Marine Biology, Far Eastern Branch, Russian Academy of Sciences, Vladivostok, RussiaDepartment of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA, USAThe complete mitochondrial (mt) genome is sequenced in 2 individuals of the Cherskii’s sculpin Cottus czerskii . A surprisingly high level of sequence divergence (10.3%) has been detected between the 2 genomes of C czerskii studied here and the GenBank mt genome of C czerskii (KJ956027). At the same time, a surprisingly low level of divergence (1.4%) has been detected between the GenBank C czerskii (KJ956027) and the Amur sculpin Cottus szanaga (KX762049, KX762050). We argue that the observed discrepancies are due to incorrect taxonomic identification so that the GenBank accession number KJ956027 represents actually the mt genome of C szanaga erroneously identified as C czerskii . Our results are of consequence concerning the GenBank database quality, highlighting the potential negative consequences of entry errors, which once they are introduced tend to be propagated among databases and subsequent publications. We illustrate the premise with the data on recombinant mt genome of the Siberian taimen Hucho taimen (NCBI Reference Sequence Database NC_016426.1; GenBank accession number HQ897271.1), bearing 2 introgressed fragments (≈0.9 kb [kilobase]) from 2 lenok subspecies, Brachymystax lenok and Brachymystax lenok tsinlingensis , submitted to GenBank on June 12, 2011. Since the time of submission, the H taimen recombinant mt genome leading to incorrect phylogenetic inferences was propagated in multiple subsequent publications despite the fact that nonrecombinant H taimen genomes were also available (submitted to GenBank on August 2, 2014; KJ711549, KJ711550). Other examples of recombinant sequences persisting in GenBank are also considered. A GenBank Entry Error Depositary is urgently needed to monitor and avoid a progressive accumulation of wrong biological information.https://doi.org/10.1177/1176934317726783 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Evgeniy S Balakirev Pavel A Saveliev Francisco J Ayala |
spellingShingle |
Evgeniy S Balakirev Pavel A Saveliev Francisco J Ayala Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome Evolutionary Bioinformatics |
author_facet |
Evgeniy S Balakirev Pavel A Saveliev Francisco J Ayala |
author_sort |
Evgeniy S Balakirev |
title |
Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome |
title_short |
Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome |
title_full |
Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome |
title_fullStr |
Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome |
title_full_unstemmed |
Complete Mitochondrial Genomes of the Cherskii’s Sculpin and Siberian Taimen Reveal GenBank Entry Errors: Incorrect Species Identification and Recombinant Mitochondrial Genome |
title_sort |
complete mitochondrial genomes of the cherskii’s sculpin and siberian taimen reveal genbank entry errors: incorrect species identification and recombinant mitochondrial genome |
publisher |
SAGE Publishing |
series |
Evolutionary Bioinformatics |
issn |
1176-9343 |
publishDate |
2017-08-01 |
description |
The complete mitochondrial (mt) genome is sequenced in 2 individuals of the Cherskii’s sculpin Cottus czerskii . A surprisingly high level of sequence divergence (10.3%) has been detected between the 2 genomes of C czerskii studied here and the GenBank mt genome of C czerskii (KJ956027). At the same time, a surprisingly low level of divergence (1.4%) has been detected between the GenBank C czerskii (KJ956027) and the Amur sculpin Cottus szanaga (KX762049, KX762050). We argue that the observed discrepancies are due to incorrect taxonomic identification so that the GenBank accession number KJ956027 represents actually the mt genome of C szanaga erroneously identified as C czerskii . Our results are of consequence concerning the GenBank database quality, highlighting the potential negative consequences of entry errors, which once they are introduced tend to be propagated among databases and subsequent publications. We illustrate the premise with the data on recombinant mt genome of the Siberian taimen Hucho taimen (NCBI Reference Sequence Database NC_016426.1; GenBank accession number HQ897271.1), bearing 2 introgressed fragments (≈0.9 kb [kilobase]) from 2 lenok subspecies, Brachymystax lenok and Brachymystax lenok tsinlingensis , submitted to GenBank on June 12, 2011. Since the time of submission, the H taimen recombinant mt genome leading to incorrect phylogenetic inferences was propagated in multiple subsequent publications despite the fact that nonrecombinant H taimen genomes were also available (submitted to GenBank on August 2, 2014; KJ711549, KJ711550). Other examples of recombinant sequences persisting in GenBank are also considered. A GenBank Entry Error Depositary is urgently needed to monitor and avoid a progressive accumulation of wrong biological information. |
url |
https://doi.org/10.1177/1176934317726783 |
work_keys_str_mv |
AT evgeniysbalakirev completemitochondrialgenomesofthecherskiissculpinandsiberiantaimenrevealgenbankentryerrorsincorrectspeciesidentificationandrecombinantmitochondrialgenome AT pavelasaveliev completemitochondrialgenomesofthecherskiissculpinandsiberiantaimenrevealgenbankentryerrorsincorrectspeciesidentificationandrecombinantmitochondrialgenome AT franciscojayala completemitochondrialgenomesofthecherskiissculpinandsiberiantaimenrevealgenbankentryerrorsincorrectspeciesidentificationandrecombinantmitochondrialgenome |
_version_ |
1724451266247000064 |