Similarity/Dissimilarity Analysis of Protein Sequences Based on a New Spectrum-Like Graphical Representation

Sequence comparison is one of the foundations in bioinformatics, which can be used to study evolutionary relations among the sequences. In this study, a 2D spectrum-like graphical representation of protein sequences is presented based on the hydrophobicity scale of amino acids. The frequencies of am...

Full description

Bibliographic Details
Main Authors: Yuhua Yao, Shoujiang Yan, Huimin Xu, Jianning Han, Xuying Nan, Ping-an He, Qi Dai
Format: Article
Language:English
Published: SAGE Publishing 2014-01-01
Series:Evolutionary Bioinformatics
Online Access:https://doi.org/10.4137/EBO.S14713
Description
Summary:Sequence comparison is one of the foundations in bioinformatics, which can be used to study evolutionary relations among the sequences. In this study, a 2D spectrum-like graphical representation of protein sequences is presented based on the hydrophobicity scale of amino acids. The frequencies of amplitudes of 4-subsequences are adopted to characterize a spectrum-like graph, and a 17D vector is used as the descriptor of protein sequence. The Ç 2 value of compatibility test is performed. New similarity analysis approach is illustrated on the all protein sequences, which are encoded by the mitochondrion genome of 20 different species. Finally, comparison with the ClustalW method shows the utility of our method.
ISSN:1176-9343