A population study of the minicircles in <it>Trypanosoma cruzi</it>: predicting guide RNAs in the absence of empirical RNA editing

<p>Abstract</p> <p>Background</p> <p>The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle gen...

Full description

Bibliographic Details
Main Authors: Westenberger Scott J, Martinez LL Isadora, Thomas Sean, Sturm Nancy R
Format: Article
Language:English
Published: BMC 2007-05-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/8/133
Description
Summary:<p>Abstract</p> <p>Background</p> <p>The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for <it>Trypanosoma cruzi</it>, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised.</p> <p>Results</p> <p>Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from <it>T. cruzi </it>whole genome shotgun sequencing data. With these sequences and all published <it>T. cruzi </it>minicircle sequences, 108 unique guide RNAs from all known <it>T. cruzi </it>minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in <it>T. cruzi </it>minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found.</p> <p>Conclusion</p> <p>The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of <it>T. cruzi </it>guide RNAs. Based on these preferences, a method of predicting <it>T. cruzi </it>guide RNAs using only primary minicircle sequence data was created.</p>
ISSN:1471-2164