Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays

<p>Abstract</p> <p>Background</p> <p>Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation.</p> <p>Results</p> <p>A method and SPR Opt (<...

Full description

Bibliographic Details
Main Authors: Wagner Mark C, Gardner Shea N
Format: Article
Language:English
Published: BMC 2005-05-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/6/73
id doaj-ae30b1295ced4eb2b8ab1a1746433325
record_format Article
spelling doaj-ae30b1295ced4eb2b8ab1a17464333252020-11-24T22:00:26ZengBMCBMC Genomics1471-21642005-05-01617310.1186/1471-2164-6-73Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assaysWagner Mark CGardner Shea N<p>Abstract</p> <p>Background</p> <p>Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation.</p> <p>Results</p> <p>A method and SPR Opt (<b>S</b>NP and <b>P</b>CR-<b>R</b>FLP <b>Opt</b>imization) software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP) and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP) analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed) are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes.</p> <p>Conclusion</p> <p>This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As more sequence data becomes available for multiple strains and isolates of a species, automated, computational approaches such as those described here will be essential to make sense of large amounts of information, and to guide and optimize efforts in the laboratory. The software and source code for SPR Opt is publicly available and free for non-profit use at <url>http://www.llnl.gov/IPandC/technology/software/softwaretitles/spropt.php</url>.</p> http://www.biomedcentral.com/1471-2164/6/73
collection DOAJ
language English
format Article
sources DOAJ
author Wagner Mark C
Gardner Shea N
spellingShingle Wagner Mark C
Gardner Shea N
Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
BMC Genomics
author_facet Wagner Mark C
Gardner Shea N
author_sort Wagner Mark C
title Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
title_short Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
title_full Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
title_fullStr Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
title_full_unstemmed Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays
title_sort software for optimization of snp and pcr-rflp genotyping to discriminate many genomes with the fewest assays
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2005-05-01
description <p>Abstract</p> <p>Background</p> <p>Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation.</p> <p>Results</p> <p>A method and SPR Opt (<b>S</b>NP and <b>P</b>CR-<b>R</b>FLP <b>Opt</b>imization) software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP) and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP) analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed) are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes.</p> <p>Conclusion</p> <p>This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As more sequence data becomes available for multiple strains and isolates of a species, automated, computational approaches such as those described here will be essential to make sense of large amounts of information, and to guide and optimize efforts in the laboratory. The software and source code for SPR Opt is publicly available and free for non-profit use at <url>http://www.llnl.gov/IPandC/technology/software/softwaretitles/spropt.php</url>.</p>
url http://www.biomedcentral.com/1471-2164/6/73
work_keys_str_mv AT wagnermarkc softwareforoptimizationofsnpandpcrrflpgenotypingtodiscriminatemanygenomeswiththefewestassays
AT gardnershean softwareforoptimizationofsnpandpcrrflpgenotypingtodiscriminatemanygenomeswiththefewestassays
_version_ 1725844456099282944