Structure-aware mycobacterium tuberculosis functional annotation uncloaks resistance, metabolic, and virulence genes

Accurate and timely functional genome annotation is essential for translating basic pathogen research into clinically impactful advances. Here, through literature curation and structure-function inference, we systematically update the functional genome annotation of Mycobacterium tuberculosis virule...

Full description

Bibliographic Details
Main Authors: Baughn, A.D (Author), Chan, C.K (Author), Dhillon, N. (Author), Dillon, N.A (Author), Elghraoui, A. (Author), Gunasekaran, D. (Author), Kuo, N. (Author), Modlin, S.J (Author), Robinhold, C. (Author), Valafar, F. (Author), Zlotnicki, A.M (Author)
Format: Article
Language:English
Published: American Society for Microbiology 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03389nam a2200529Ia 4500
001 10.1128-mSystems.00673-21
008 220427s2021 CNT 000 0 und d
020 |a 23795077 (ISSN) 
245 1 0 |a Structure-aware mycobacterium tuberculosis functional annotation uncloaks resistance, metabolic, and virulence genes 
260 0 |b American Society for Microbiology  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1128/mSystems.00673-21 
520 3 |a Accurate and timely functional genome annotation is essential for translating basic pathogen research into clinically impactful advances. Here, through literature curation and structure-function inference, we systematically update the functional genome annotation of Mycobacterium tuberculosis virulent type strain H37Rv. First, we systematically curated annotations for 589 genes from 662 publications, including 282 gene products absent from leading databases. Second, we modeled 1,711 underannotated proteins and developed a semiautomated pipeline that captured shared function between 400 protein models and structural matches of known function on Protein Data Bank, including drug efflux proteins, metabolic enzymes, and virulence factors. In aggregate, these structure- and literature-derived annotations update 940/1,725 underannotated H37Rv genes and generate hundreds of functional hypotheses. Retrospectively applying the annotation to a recent whole-genome transposon mutant screen provided missing function for 48% (13/27) of underannotated genes altering antibiotic efficacy and 33% (23/69) required for persistence during mouse tuberculosis (TB) infection. Prospective application of the protein models enabled us to functionally interpret novel laboratory generated pyrazinamide (PZA)-resistant mutants of unknown function, which implicated the emerging coenzyme A depletion model of PZA action in the mutants’ PZA resistance. Our findings demonstrate the functional insight gained by integrating structural modeling and systematic literature curation, even for widely studied microorganisms. Functional annotations and protein structure models are available at https://tuberculosis.sdsu.edu/H37Rv in human- and machine-readable formats. Copyright © 2021 Modlin et al. 
650 0 4 |a antibiotic agent 
650 0 4 |a antibiotic resistance 
650 0 4 |a Article 
650 0 4 |a bacterial enzyme 
650 0 4 |a bacterial gene 
650 0 4 |a bacterial genome 
650 0 4 |a bacterial protein 
650 0 4 |a bacterial virulence 
650 0 4 |a bacterium mutant 
650 0 4 |a carrier protein 
650 0 4 |a coenzyme A 
650 0 4 |a controlled study 
650 0 4 |a drug efficacy 
650 0 4 |a drug metabolism 
650 0 4 |a Mycobacterium tuberculosis 
650 0 4 |a nonhuman 
650 0 4 |a Protein Data Bank 
650 0 4 |a protein structure 
650 0 4 |a pyrazinamide 
650 0 4 |a transposon 
650 0 4 |a tuberculosis 
650 0 4 |a type strain 
650 0 4 |a virulence factor 
700 1 |a Baughn, A.D.  |e author 
700 1 |a Chan, C.K.  |e author 
700 1 |a Dhillon, N.  |e author 
700 1 |a Dillon, N.A.  |e author 
700 1 |a Elghraoui, A.  |e author 
700 1 |a Gunasekaran, D.  |e author 
700 1 |a Kuo, N.  |e author 
700 1 |a Modlin, S.J.  |e author 
700 1 |a Robinhold, C.  |e author 
700 1 |a Valafar, F.  |e author 
700 1 |a Zlotnicki, A.M.  |e author 
773 |t mSystems