Identification of proprotein convertase substrates using genome-wide expression correlation analysis

<p>Abstract</p> <p>Background</p> <p>Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and...

Full description

Bibliographic Details
Main Authors: Turpeinen Hannu, Kukkurainen Sampo, Pulkkinen Kati, Kauppila Timo, Ojala Kalle, Hytönen Vesa P, Pesu Marko
Format: Article
Language:English
Published: BMC 2011-12-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/12/618
id doaj-be77a485522146a0836e04bc8055c68f
record_format Article
spelling doaj-be77a485522146a0836e04bc8055c68f2020-11-24T20:51:35ZengBMCBMC Genomics1471-21642011-12-0112161810.1186/1471-2164-12-618Identification of proprotein convertase substrates using genome-wide expression correlation analysisTurpeinen HannuKukkurainen SampoPulkkinen KatiKauppila TimoOjala KalleHytönen Vesa PPesu Marko<p>Abstract</p> <p>Background</p> <p>Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and lysine, which results in functional maturation of the target protein. <it>In vitro </it>assays have showed significant biochemical redundancy between the seven family members, but the phenotypes of PCSK deficient mice and patients carrying an inactive PCSK allele argue for a specific biological function. Modeling the structures of individual PCSK enzymes has offered little insights into the specificity determinants. However, previous studies have shown that there can be a coordinated expression between a PCSK and its target molecule. Here, we have surveyed the putative PCSK target proteins using genome-wide expression correlation analysis and cleavage site prediction algorithms.</p> <p>Results</p> <p>We first performed a gene expression correlation analysis over the whole genome for all PCSK enzymes. PCSKs were found to cluster differently based on the strength of correlations. The screen for putative PCSK target proteins showed a significant enrichment (p-values from 1.2e-4 to < 1.0e-10) of putative targets among the most positively correlating genes for most PCSKs. Interestingly, there was no enrichment in putative targets among the genes that correlated positively with the biologically redundant PCSK7, whereas PCSK5 showed an inverse correlation. PCSKs also showed a highly variable degree of shared target genes that were identified by expression correlation and cleavage site prediction. Multiple alignments were used to evaluate the putative targets to pinpoint the important residues for the substrate recognition. Finally, we validated our approach and identified biochemically PAPPA1 and ADAMTS6 as novel targets for FURIN proteolytic activity.</p> <p>Conclusions</p> <p>Most PCSK enzymes display strong positive expression correlation with predicted target proteins in our genome-wide analysis. We also show that expression correlation screen combined with a cleavage site-prediction analysis can be used to identify novel bona fide target molecules for PCSKs. Exploring the positively correlating genes can thus offer additional insights into the biology of proprotein convertases.</p> http://www.biomedcentral.com/1471-2164/12/618
collection DOAJ
language English
format Article
sources DOAJ
author Turpeinen Hannu
Kukkurainen Sampo
Pulkkinen Kati
Kauppila Timo
Ojala Kalle
Hytönen Vesa P
Pesu Marko
spellingShingle Turpeinen Hannu
Kukkurainen Sampo
Pulkkinen Kati
Kauppila Timo
Ojala Kalle
Hytönen Vesa P
Pesu Marko
Identification of proprotein convertase substrates using genome-wide expression correlation analysis
BMC Genomics
author_facet Turpeinen Hannu
Kukkurainen Sampo
Pulkkinen Kati
Kauppila Timo
Ojala Kalle
Hytönen Vesa P
Pesu Marko
author_sort Turpeinen Hannu
title Identification of proprotein convertase substrates using genome-wide expression correlation analysis
title_short Identification of proprotein convertase substrates using genome-wide expression correlation analysis
title_full Identification of proprotein convertase substrates using genome-wide expression correlation analysis
title_fullStr Identification of proprotein convertase substrates using genome-wide expression correlation analysis
title_full_unstemmed Identification of proprotein convertase substrates using genome-wide expression correlation analysis
title_sort identification of proprotein convertase substrates using genome-wide expression correlation analysis
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-12-01
description <p>Abstract</p> <p>Background</p> <p>Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and lysine, which results in functional maturation of the target protein. <it>In vitro </it>assays have showed significant biochemical redundancy between the seven family members, but the phenotypes of PCSK deficient mice and patients carrying an inactive PCSK allele argue for a specific biological function. Modeling the structures of individual PCSK enzymes has offered little insights into the specificity determinants. However, previous studies have shown that there can be a coordinated expression between a PCSK and its target molecule. Here, we have surveyed the putative PCSK target proteins using genome-wide expression correlation analysis and cleavage site prediction algorithms.</p> <p>Results</p> <p>We first performed a gene expression correlation analysis over the whole genome for all PCSK enzymes. PCSKs were found to cluster differently based on the strength of correlations. The screen for putative PCSK target proteins showed a significant enrichment (p-values from 1.2e-4 to < 1.0e-10) of putative targets among the most positively correlating genes for most PCSKs. Interestingly, there was no enrichment in putative targets among the genes that correlated positively with the biologically redundant PCSK7, whereas PCSK5 showed an inverse correlation. PCSKs also showed a highly variable degree of shared target genes that were identified by expression correlation and cleavage site prediction. Multiple alignments were used to evaluate the putative targets to pinpoint the important residues for the substrate recognition. Finally, we validated our approach and identified biochemically PAPPA1 and ADAMTS6 as novel targets for FURIN proteolytic activity.</p> <p>Conclusions</p> <p>Most PCSK enzymes display strong positive expression correlation with predicted target proteins in our genome-wide analysis. We also show that expression correlation screen combined with a cleavage site-prediction analysis can be used to identify novel bona fide target molecules for PCSKs. Exploring the positively correlating genes can thus offer additional insights into the biology of proprotein convertases.</p>
url http://www.biomedcentral.com/1471-2164/12/618
work_keys_str_mv AT turpeinenhannu identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT kukkurainensampo identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT pulkkinenkati identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT kauppilatimo identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT ojalakalle identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT hytonenvesap identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
AT pesumarko identificationofproproteinconvertasesubstratesusinggenomewideexpressioncorrelationanalysis
_version_ 1716801714922717184