Predicting biological pathways of chemical compounds with a profile-inspired approach

Background: Assignment of chemical compounds to biological pathways is a crucial step to understand the relationship between the chemical repertory of an organism and its biology. Protein sequence profiles are very successful in capturing the main structural and functional features of a protein fami...

Full description

Bibliographic Details
Main Authors: Chagoyen, M. (Author), Lopez-Ibañez, J. (Author), Pazos, F. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02367nam a2200421Ia 4500
001 10.1186-s12859-021-04252-y
008 220427s2021 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a Predicting biological pathways of chemical compounds with a profile-inspired approach 
260 0 |b BioMed Central Ltd  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-021-04252-y 
520 3 |a Background: Assignment of chemical compounds to biological pathways is a crucial step to understand the relationship between the chemical repertory of an organism and its biology. Protein sequence profiles are very successful in capturing the main structural and functional features of a protein family, and can be used to assign new members to it based on matching of their sequences against these profiles. In this work, we extend this idea to chemical compounds, constructing a profile-inspired model for a set of related metabolites (those in the same biological pathway), based on a fragment-based vectorial representation of their chemical structures. Results: We use this representation to predict the biological pathway of a chemical compound with good overall accuracy (AUC 0.74–0.90 depending on the database tested), and analyzed some factors that affect performance. The approach, which is compared with equivalent methods, can in addition detect those molecular fragments characteristic of a pathway. Conclusions: The method is available as a graphical interactive web server http://csbg.cnb.csic.es/iFragMent. © 2021, The Author(s). 
650 0 4 |a amino acid sequence 
650 0 4 |a Amino Acid Sequence 
650 0 4 |a article 
650 0 4 |a Biological pathways 
650 0 4 |a Chemical analysis 
650 0 4 |a Databases, Factual 
650 0 4 |a factual database 
650 0 4 |a Functional features 
650 0 4 |a Internet 
650 0 4 |a Internet 
650 0 4 |a Metabolites 
650 0 4 |a Molecular fragments 
650 0 4 |a New members 
650 0 4 |a Overall accuracies 
650 0 4 |a protein 
650 0 4 |a Protein family 
650 0 4 |a Protein sequence profiles 
650 0 4 |a Proteins 
650 0 4 |a Proteins 
650 0 4 |a software 
650 0 4 |a Software 
650 0 4 |a Web servers 
700 1 |a Chagoyen, M.  |e author 
700 1 |a Lopez-Ibañez, J.  |e author 
700 1 |a Pazos, F.  |e author 
773 |t BMC Bioinformatics