Improved contact predictions using the recognition of protein like contact patterns.

Given sufficient large protein families, and using a global statistical inference approach, it is possible to obtain sufficient accuracy in protein residue contact predictions to predict the structure of many proteins. However, these approaches do not consider the fact that the contacts in a protein...

Full description

Bibliographic Details
Main Authors: Marcin J Skwark, Daniele Raimondi, Mirco Michel, Arne Elofsson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-11-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC4222596?pdf=render
id doaj-bf3ee7bc23c24adf947d0a973eb0912b
record_format Article
spelling doaj-bf3ee7bc23c24adf947d0a973eb0912b2020-11-24T21:51:04ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582014-11-011011e100388910.1371/journal.pcbi.1003889Improved contact predictions using the recognition of protein like contact patterns.Marcin J SkwarkDaniele RaimondiMirco MichelArne ElofssonGiven sufficient large protein families, and using a global statistical inference approach, it is possible to obtain sufficient accuracy in protein residue contact predictions to predict the structure of many proteins. However, these approaches do not consider the fact that the contacts in a protein are neither randomly, nor independently distributed, but actually follow precise rules governed by the structure of the protein and thus are interdependent. Here, we present PconsC2, a novel method that uses a deep learning approach to identify protein-like contact patterns to improve contact predictions. A substantial enhancement can be seen for all contacts independently on the number of aligned sequences, residue separation or secondary structure type, but is largest for β-sheet containing proteins. In addition to being superior to earlier methods based on statistical inferences, in comparison to state of the art methods using machine learning, PconsC2 is superior for families with more than 100 effective sequence homologs. The improved contact prediction enables improved structure prediction.http://europepmc.org/articles/PMC4222596?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Marcin J Skwark
Daniele Raimondi
Mirco Michel
Arne Elofsson
spellingShingle Marcin J Skwark
Daniele Raimondi
Mirco Michel
Arne Elofsson
Improved contact predictions using the recognition of protein like contact patterns.
PLoS Computational Biology
author_facet Marcin J Skwark
Daniele Raimondi
Mirco Michel
Arne Elofsson
author_sort Marcin J Skwark
title Improved contact predictions using the recognition of protein like contact patterns.
title_short Improved contact predictions using the recognition of protein like contact patterns.
title_full Improved contact predictions using the recognition of protein like contact patterns.
title_fullStr Improved contact predictions using the recognition of protein like contact patterns.
title_full_unstemmed Improved contact predictions using the recognition of protein like contact patterns.
title_sort improved contact predictions using the recognition of protein like contact patterns.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2014-11-01
description Given sufficient large protein families, and using a global statistical inference approach, it is possible to obtain sufficient accuracy in protein residue contact predictions to predict the structure of many proteins. However, these approaches do not consider the fact that the contacts in a protein are neither randomly, nor independently distributed, but actually follow precise rules governed by the structure of the protein and thus are interdependent. Here, we present PconsC2, a novel method that uses a deep learning approach to identify protein-like contact patterns to improve contact predictions. A substantial enhancement can be seen for all contacts independently on the number of aligned sequences, residue separation or secondary structure type, but is largest for β-sheet containing proteins. In addition to being superior to earlier methods based on statistical inferences, in comparison to state of the art methods using machine learning, PconsC2 is superior for families with more than 100 effective sequence homologs. The improved contact prediction enables improved structure prediction.
url http://europepmc.org/articles/PMC4222596?pdf=render
work_keys_str_mv AT marcinjskwark improvedcontactpredictionsusingtherecognitionofproteinlikecontactpatterns
AT danieleraimondi improvedcontactpredictionsusingtherecognitionofproteinlikecontactpatterns
AT mircomichel improvedcontactpredictionsusingtherecognitionofproteinlikecontactpatterns
AT arneelofsson improvedcontactpredictionsusingtherecognitionofproteinlikecontactpatterns
_version_ 1725880701053566976