Domain fusion analysis by applying relational algebra to protein sequence and domain databases
<p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PR...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2003-05-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/4/16 |
id |
doaj-75d61a99436843b0a2d56a1d00b2101e |
---|---|
record_format |
Article |
spelling |
doaj-75d61a99436843b0a2d56a1d00b2101e2020-11-25T01:47:05ZengBMCBMC Bioinformatics1471-21052003-05-01411610.1186/1471-2105-4-16Domain fusion analysis by applying relational algebra to protein sequence and domain databasesIkura MitsuhikoTruong Kevin<p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful.</p> <p>Results</p> <p>This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in <it>H. sapiens </it>and <it>S. cerevisiae</it>, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at <url>http://calcium.uhnres.utoronto.ca/pi</url>.</p> <p>Conclusion</p> <p>As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.</p> http://www.biomedcentral.com/1471-2105/4/16 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ikura Mitsuhiko Truong Kevin |
spellingShingle |
Ikura Mitsuhiko Truong Kevin Domain fusion analysis by applying relational algebra to protein sequence and domain databases BMC Bioinformatics |
author_facet |
Ikura Mitsuhiko Truong Kevin |
author_sort |
Ikura Mitsuhiko |
title |
Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_short |
Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_full |
Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_fullStr |
Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_full_unstemmed |
Domain fusion analysis by applying relational algebra to protein sequence and domain databases |
title_sort |
domain fusion analysis by applying relational algebra to protein sequence and domain databases |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2003-05-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful.</p> <p>Results</p> <p>This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in <it>H. sapiens </it>and <it>S. cerevisiae</it>, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at <url>http://calcium.uhnres.utoronto.ca/pi</url>.</p> <p>Conclusion</p> <p>As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.</p> |
url |
http://www.biomedcentral.com/1471-2105/4/16 |
work_keys_str_mv |
AT ikuramitsuhiko domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases AT truongkevin domainfusionanalysisbyapplyingrelationalalgebratoproteinsequenceanddomaindatabases |
_version_ |
1725016479747801088 |