Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing
Abstract Background Clustering of protein sequences is of key importance in predicting the structure and function of newly sequenced proteins and is also of use for their annotation. With the advent of multiple high-throughput sequencing technologies, new protein sequences are becoming available at...
Main Authors: | Armen Abnousi, Shira L. Broschat, Ananth Kalyanaraman |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-03-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-018-2080-y |
Similar Items
-
A Fast Alignment-Free Approach for De Novo Detection of Protein Conserved Regions.
by: Armen Abnousi, et al.
Published: (2016-01-01) -
Whole Proteome Clustering of 2,307 Proteobacterial Genomes Reveals Conserved Proteins and Significant Annotation Issues
by: Svetlana Lockwood, et al.
Published: (2019-02-01) -
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks
by: Le Ou-Yang, et al.
Published: (2017-12-01) -
PASS: Protein Annotation Surveillance Site for Protein Annotation Using Homologous Clusters, NLP, and Sequence Similarity Networks
by: Jin Tao, et al.
Published: (2021-09-01) -
Identification of Unannotated Small Genes in Salmonella
by: Jonghwan Baek, et al.
Published: (2017-03-01)