High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example

<p>Abstract</p> <p>Background</p> <p>While the genomic annotations of diverse lineages of the <it>Mycobacterium tuberculosis </it>complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation...

Full description

Bibliographic Details
Main Authors: Prasad Swati, Sælensminde Gisle, Søfteland Tina, Målen Hiwa, de Souza Gustavo A, Jonassen Inge, Wiker Harald G
Format: Article
Language:English
Published: BMC 2008-07-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/9/316
id doaj-1a4865aee20049ed95032b71dae83676
record_format Article
spelling doaj-1a4865aee20049ed95032b71dae836762020-11-25T01:04:44ZengBMCBMC Genomics1471-21642008-07-019131610.1186/1471-2164-9-316High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an examplePrasad SwatiSælensminde GisleSøfteland TinaMålen Hiwade Souza Gustavo AJonassen IngeWiker Harald G<p>Abstract</p> <p>Background</p> <p>While the genomic annotations of diverse lineages of the <it>Mycobacterium tuberculosis </it>complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation. <it>M. tuberculosis </it>gene annotation is an example, where the most used datasets from two independent institutions (Sanger Institute and Institute of Genomic Research-TIGR) differ up to 12% in the number of annotated open reading frames, and 46% of the genes contained in both annotations have different start codons. Such differences emphasize the importance of the identification of the sequence of protein products to validate each gene annotation including its sequence coding area.</p> <p>Results</p> <p>With this objective, we submitted a culture filtrate sample from <it>M. tuberculosis </it>to a high-accuracy LTQ-Orbitrap mass spectrometer analysis and applied refined N-terminal prediction to perform comparison of two gene annotations. From a total of 449 proteins identified from the MS data, we validated 35 tryptic peptides that were specific to one of the two datasets, representing 24 different proteins. From those, 5 proteins were only annotated in the Sanger database. In the remaining proteins, the observed differences were due to differences in annotation of transcriptional start sites.</p> <p>Conclusion</p> <p>Our results indicate that, even in a less complex sample likely to represent only 10% of the bacterial proteome, we were still able to detect major differences between different gene annotation approaches. This gives hope that high-throughput proteomics techniques can be used to improve and validate gene annotations, and in particular for verification of high-throughput, automatic gene annotations.</p> http://www.biomedcentral.com/1471-2164/9/316
collection DOAJ
language English
format Article
sources DOAJ
author Prasad Swati
Sælensminde Gisle
Søfteland Tina
Målen Hiwa
de Souza Gustavo A
Jonassen Inge
Wiker Harald G
spellingShingle Prasad Swati
Sælensminde Gisle
Søfteland Tina
Målen Hiwa
de Souza Gustavo A
Jonassen Inge
Wiker Harald G
High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
BMC Genomics
author_facet Prasad Swati
Sælensminde Gisle
Søfteland Tina
Målen Hiwa
de Souza Gustavo A
Jonassen Inge
Wiker Harald G
author_sort Prasad Swati
title High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
title_short High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
title_full High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
title_fullStr High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
title_full_unstemmed High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>Mycobacterium tuberculosis </it>as an example
title_sort high accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using <it>mycobacterium tuberculosis </it>as an example
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2008-07-01
description <p>Abstract</p> <p>Background</p> <p>While the genomic annotations of diverse lineages of the <it>Mycobacterium tuberculosis </it>complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation. <it>M. tuberculosis </it>gene annotation is an example, where the most used datasets from two independent institutions (Sanger Institute and Institute of Genomic Research-TIGR) differ up to 12% in the number of annotated open reading frames, and 46% of the genes contained in both annotations have different start codons. Such differences emphasize the importance of the identification of the sequence of protein products to validate each gene annotation including its sequence coding area.</p> <p>Results</p> <p>With this objective, we submitted a culture filtrate sample from <it>M. tuberculosis </it>to a high-accuracy LTQ-Orbitrap mass spectrometer analysis and applied refined N-terminal prediction to perform comparison of two gene annotations. From a total of 449 proteins identified from the MS data, we validated 35 tryptic peptides that were specific to one of the two datasets, representing 24 different proteins. From those, 5 proteins were only annotated in the Sanger database. In the remaining proteins, the observed differences were due to differences in annotation of transcriptional start sites.</p> <p>Conclusion</p> <p>Our results indicate that, even in a less complex sample likely to represent only 10% of the bacterial proteome, we were still able to detect major differences between different gene annotation approaches. This gives hope that high-throughput proteomics techniques can be used to improve and validate gene annotations, and in particular for verification of high-throughput, automatic gene annotations.</p>
url http://www.biomedcentral.com/1471-2164/9/316
work_keys_str_mv AT prasadswati highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT sælensmindegisle highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT søftelandtina highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT malenhiwa highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT desouzagustavoa highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT jonasseninge highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
AT wikerharaldg highaccuracymassspectrometryanalysisasatooltoverifyandimprovegeneannotationusingitmycobacteriumtuberculosisitasanexample
_version_ 1725196349484302336