|
|
|
|
LEADER |
02367nam a2200421Ia 4500 |
001 |
10.1186-s12859-021-04252-y |
008 |
220427s2021 CNT 000 0 und d |
020 |
|
|
|a 14712105 (ISSN)
|
245 |
1 |
0 |
|a Predicting biological pathways of chemical compounds with a profile-inspired approach
|
260 |
|
0 |
|b BioMed Central Ltd
|c 2021
|
856 |
|
|
|z View Fulltext in Publisher
|u https://doi.org/10.1186/s12859-021-04252-y
|
520 |
3 |
|
|a Background: Assignment of chemical compounds to biological pathways is a crucial step to understand the relationship between the chemical repertory of an organism and its biology. Protein sequence profiles are very successful in capturing the main structural and functional features of a protein family, and can be used to assign new members to it based on matching of their sequences against these profiles. In this work, we extend this idea to chemical compounds, constructing a profile-inspired model for a set of related metabolites (those in the same biological pathway), based on a fragment-based vectorial representation of their chemical structures. Results: We use this representation to predict the biological pathway of a chemical compound with good overall accuracy (AUC 0.74–0.90 depending on the database tested), and analyzed some factors that affect performance. The approach, which is compared with equivalent methods, can in addition detect those molecular fragments characteristic of a pathway. Conclusions: The method is available as a graphical interactive web server http://csbg.cnb.csic.es/iFragMent. © 2021, The Author(s).
|
650 |
0 |
4 |
|a amino acid sequence
|
650 |
0 |
4 |
|a Amino Acid Sequence
|
650 |
0 |
4 |
|a article
|
650 |
0 |
4 |
|a Biological pathways
|
650 |
0 |
4 |
|a Chemical analysis
|
650 |
0 |
4 |
|a Databases, Factual
|
650 |
0 |
4 |
|a factual database
|
650 |
0 |
4 |
|a Functional features
|
650 |
0 |
4 |
|a Internet
|
650 |
0 |
4 |
|a Internet
|
650 |
0 |
4 |
|a Metabolites
|
650 |
0 |
4 |
|a Molecular fragments
|
650 |
0 |
4 |
|a New members
|
650 |
0 |
4 |
|a Overall accuracies
|
650 |
0 |
4 |
|a protein
|
650 |
0 |
4 |
|a Protein family
|
650 |
0 |
4 |
|a Protein sequence profiles
|
650 |
0 |
4 |
|a Proteins
|
650 |
0 |
4 |
|a Proteins
|
650 |
0 |
4 |
|a software
|
650 |
0 |
4 |
|a Software
|
650 |
0 |
4 |
|a Web servers
|
700 |
1 |
|
|a Chagoyen, M.
|e author
|
700 |
1 |
|
|a Lopez-Ibañez, J.
|e author
|
700 |
1 |
|
|a Pazos, F.
|e author
|
773 |
|
|
|t BMC Bioinformatics
|