Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.

Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singu...

Full description

Bibliographic Details
Main Authors: Akira R Kinjo, Haruki Nakamura
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-04-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2276316?pdf=render
id doaj-05688ccb4a4d40c79b191dcebcba9b03
record_format Article
spelling doaj-05688ccb4a4d40c79b191dcebcba9b032020-11-25T00:27:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032008-04-0134e196310.1371/journal.pone.0001963Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.Akira R KinjoHaruki NakamuraPosition-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singular value decomposition to a set of PSSMs and examine the properties of dominant right and left singular vectors. The first right singular vectors were correlated with various amino acid indices including relative mutability, amino acid composition in protein interior, hydropathy, or turn propensity, depending on proteins. A significant correlation between the first left singular vector and a measure of site conservation was observed. It is shown that the contribution of the first singular component to the PSSMs act to disfavor potentially but falsely functionally important residues at conserved sites. The second right singular vectors were highly correlated with hydrophobicity scales, and the corresponding left singular vectors with contact numbers of protein structures. It is suggested that sequence alignment with a PSSM is essentially equivalent to threading supplemented with functional information. In addition, singular vectors may be useful for analyzing and annotating the characteristics of conserved sites in protein families.http://europepmc.org/articles/PMC2276316?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Akira R Kinjo
Haruki Nakamura
spellingShingle Akira R Kinjo
Haruki Nakamura
Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
PLoS ONE
author_facet Akira R Kinjo
Haruki Nakamura
author_sort Akira R Kinjo
title Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
title_short Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
title_full Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
title_fullStr Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
title_full_unstemmed Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
title_sort nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2008-04-01
description Position-specific scoring matrices (PSSMs) are useful for detecting weak homology in protein sequence analysis, and they are thought to contain some essential signatures of the protein families. In order to elucidate what kind of ingredients constitute such family-specific signatures, we apply singular value decomposition to a set of PSSMs and examine the properties of dominant right and left singular vectors. The first right singular vectors were correlated with various amino acid indices including relative mutability, amino acid composition in protein interior, hydropathy, or turn propensity, depending on proteins. A significant correlation between the first left singular vector and a measure of site conservation was observed. It is shown that the contribution of the first singular component to the PSSMs act to disfavor potentially but falsely functionally important residues at conserved sites. The second right singular vectors were highly correlated with hydrophobicity scales, and the corresponding left singular vectors with contact numbers of protein structures. It is suggested that sequence alignment with a PSSM is essentially equivalent to threading supplemented with functional information. In addition, singular vectors may be useful for analyzing and annotating the characteristics of conserved sites in protein families.
url http://europepmc.org/articles/PMC2276316?pdf=render
work_keys_str_mv AT akirarkinjo natureofproteinfamilysignaturesinsightsfromsingularvalueanalysisofpositionspecificscoringmatrices
AT harukinakamura natureofproteinfamilysignaturesinsightsfromsingularvalueanalysisofpositionspecificscoringmatrices
_version_ 1725341498892877824