Identifying protein complexes based on an edge weight algorithm and core-attachment structure

Abstract Background Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, mos...

Full description

Bibliographic Details
Main Authors: Rongquan Wang, Guixia Liu, Caixia Wang
Format: Article
Language:English
Published: BMC 2019-09-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-3007-y
id doaj-de4857acc16d4e929c066a2ce75f2590
record_format Article
spelling doaj-de4857acc16d4e929c066a2ce75f25902020-11-25T03:41:10ZengBMCBMC Bioinformatics1471-21052019-09-0120112010.1186/s12859-019-3007-yIdentifying protein complexes based on an edge weight algorithm and core-attachment structureRongquan Wang0Guixia Liu1Caixia Wang2College of Computer Science and Technology, Jilin UniversityCollege of Computer Science and Technology, Jilin UniversitySchool of International Economics, China Foreign Affairs UniversityAbstract Background Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, most of the current state-of-the-art studies still have some challenges to resolve, including their high false-positives rates, incapability of identifying overlapping complexes, lack of consideration for the inherent organization within protein complexes, and absence of some biological attachment proteins. Results In this paper, to overcome these limitations, we present a protein complex identification method based on an edge weight method and core-attachment structure (EWCA) which consists of a complex core and some sparse attachment proteins. First, we propose a new weighting method to assess the reliability of interactions. Second, we identify protein complex cores by using the structural similarity between a seed and its direct neighbors. Third, we introduce a new method to detect attachment proteins that is able to distinguish and identify peripheral proteins and overlapping proteins. Finally, we bind attachment proteins to their corresponding complex cores to form protein complexes and discard redundant protein complexes. The experimental results indicate that EWCA outperforms existing state-of-the-art methods in terms of both accuracy and p-value. Furthermore, EWCA could identify many more protein complexes with statistical significance. Additionally, EWCA could have better balance accuracy and efficiency than some state-of-the-art methods with high accuracy. Conclusions In summary, EWCA has better performance for protein complex identification by a comprehensive comparison with twelve algorithms in terms of different evaluation metrics. The datasets and software are freely available for academic research at https://github.com/RongquanWang/EWCA.http://link.springer.com/article/10.1186/s12859-019-3007-yProtein complexesProtein-protein interaction networksCore-attachment structureSpurious interactionsStructural similarity
collection DOAJ
language English
format Article
sources DOAJ
author Rongquan Wang
Guixia Liu
Caixia Wang
spellingShingle Rongquan Wang
Guixia Liu
Caixia Wang
Identifying protein complexes based on an edge weight algorithm and core-attachment structure
BMC Bioinformatics
Protein complexes
Protein-protein interaction networks
Core-attachment structure
Spurious interactions
Structural similarity
author_facet Rongquan Wang
Guixia Liu
Caixia Wang
author_sort Rongquan Wang
title Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_short Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_full Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_fullStr Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_full_unstemmed Identifying protein complexes based on an edge weight algorithm and core-attachment structure
title_sort identifying protein complexes based on an edge weight algorithm and core-attachment structure
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-09-01
description Abstract Background Protein complex identification from protein-protein interaction (PPI) networks is crucial for understanding cellular organization principles and functional mechanisms. In recent decades, numerous computational methods have been proposed to identify protein complexes. However, most of the current state-of-the-art studies still have some challenges to resolve, including their high false-positives rates, incapability of identifying overlapping complexes, lack of consideration for the inherent organization within protein complexes, and absence of some biological attachment proteins. Results In this paper, to overcome these limitations, we present a protein complex identification method based on an edge weight method and core-attachment structure (EWCA) which consists of a complex core and some sparse attachment proteins. First, we propose a new weighting method to assess the reliability of interactions. Second, we identify protein complex cores by using the structural similarity between a seed and its direct neighbors. Third, we introduce a new method to detect attachment proteins that is able to distinguish and identify peripheral proteins and overlapping proteins. Finally, we bind attachment proteins to their corresponding complex cores to form protein complexes and discard redundant protein complexes. The experimental results indicate that EWCA outperforms existing state-of-the-art methods in terms of both accuracy and p-value. Furthermore, EWCA could identify many more protein complexes with statistical significance. Additionally, EWCA could have better balance accuracy and efficiency than some state-of-the-art methods with high accuracy. Conclusions In summary, EWCA has better performance for protein complex identification by a comprehensive comparison with twelve algorithms in terms of different evaluation metrics. The datasets and software are freely available for academic research at https://github.com/RongquanWang/EWCA.
topic Protein complexes
Protein-protein interaction networks
Core-attachment structure
Spurious interactions
Structural similarity
url http://link.springer.com/article/10.1186/s12859-019-3007-y
work_keys_str_mv AT rongquanwang identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure
AT guixialiu identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure
AT caixiawang identifyingproteincomplexesbasedonanedgeweightalgorithmandcoreattachmentstructure
_version_ 1724531193199722496