A high level interface to SCOP and ASTRAL implemented in Python

<p>Abstract</p> <p>Background</p> <p>Benchmarking algorithms in structural bioinformatics often involves the construction of datasets of proteins with given sequence and structural properties. The SCOP database is a manually curated structural classification which group...

Full description

Bibliographic Details
Main Authors: Saqi Mansoor AS, Crooks Gavin E, Casbon James A
Format: Article
Language:English
Published: BMC 2006-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/10
Description
Summary:<p>Abstract</p> <p>Background</p> <p>Benchmarking algorithms in structural bioinformatics often involves the construction of datasets of proteins with given sequence and structural properties. The SCOP database is a manually curated structural classification which groups together proteins on the basis of structural similarity. The ASTRAL compendium provides non redundant subsets of SCOP domains on the basis of sequence similarity such that no two domains in a given subset share more than a defined degree of sequence similarity. Taken together these two resources provide a 'ground truth' for assessing structural bioinformatics algorithms. We present a small and easy to use API written in python to enable construction of datasets from these resources.</p> <p>Results</p> <p>We have designed a set of python modules to provide an abstraction of the SCOP and ASTRAL databases. The modules are designed to work as part of the Biopython distribution. Python users can now manipulate and use the SCOP hierarchy from within python programs, and use ASTRAL to return sequences of domains in SCOP, as well as clustered representations of SCOP from ASTRAL.</p> <p>Conclusion</p> <p>The modules make the analysis and generation of datasets for use in structural genomics easier and more principled.</p>
ISSN:1471-2105