Summary: | Essential proteins are important for the survival and reproduction of organisms. Many computational methods have been proposed to identify essential proteins, due to the production of vast amounts of protein-protein interaction (PPI) data. It has been demonstrated that PPI networks have graph-theoretic characteristics as so-called small-world and scale-free. The traditional metrics cannot really reflect the relationship between proteins when identifying essential proteins from PPI networks. In this paper, we construct a diffusion distance network (DSN) by combining PPI topology characteristics with orthologous proteins and sub-cellular localization information of proteins. Taking the modularity feature of essential proteins into account, we proposed a new essential proteins prediction method based on DSN. We employed our DSN method and ten other state-of-the-art methods to predict essential proteins. The precision-recall curve, jackknife methodology and so on are used to test the performance of these methods. Experimental results show that our method outperform ten other competitive methods. The row data and the software are freely available at: https://github.com/husaiccsu/DSN.
|