MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy

Abstract Background Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not onl...

Full description

Bibliographic Details
Main Authors: Fan Yang, Yang Liu, Yanbin Wang, Zhijian Yin, Zhen Yang
Format: Article
Language:English
Published: BMC 2019-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-3136-3
id doaj-6f8c00de154c479986009b6cdc864218
record_format Article
spelling doaj-6f8c00de154c479986009b6cdc8642182020-11-25T03:43:27ZengBMCBMC Bioinformatics1471-21052019-10-0120112110.1186/s12859-019-3136-3MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategyFan Yang0Yang Liu1Yanbin Wang2Zhijian Yin3Zhen Yang4School of Communications and Electronics, Jiangxi Science & Technology Normal UniversitySchool of Communications and Electronics, Jiangxi Science & Technology Normal UniversitySchool of Communications and Electronics, Jiangxi Science & Technology Normal UniversitySchool of Communications and Electronics, Jiangxi Science & Technology Normal UniversitySchool of Communications and Electronics, Jiangxi Science & Technology Normal UniversityAbstract Background Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not only should be a guiding role in drug design and development due to potential molecular targets but also be an essential role in genome annotation. Taking the current status of image-based protein subcellular localization as an example, there are three common drawbacks, i.e., obsolete datasets without updating label information, stereotypical feature descriptor on spatial domain or grey level, and single-function prediction algorithm’s limited capacity of handling single-label database. Results In this paper, a novel human protein subcellular localization prediction model MIC_Locator is proposed. Firstly, the latest datasets are collected and collated as our benchmark dataset instead of obsolete data while training prediction model. Secondly, Fourier transformation, Riesz transformation, Log-Gabor filter and intensity coding strategy are employed to obtain frequency feature based on three components of monogenic signal with different frequency scales. Thirdly, a chained prediction model is proposed to handle multi-label instead of single-label datasets. The experiment results showed that the MIC_Locator can achieve 60.56% subset accuracy and outperform the existing majority of prediction models, and the frequency feature and intensity coding strategy can be conducive to improving the classification accuracy. Conclusions Our results demonstrate that the frequency feature is more beneficial for improving the performance of model compared to features extracted from spatial domain, and the MIC_Locator proposed in this paper can speed up validation of protein annotation, knowledge of protein function and proteomics research.http://link.springer.com/article/10.1186/s12859-019-3136-3Bioimage informaticsProtein subcellular localizationFrequency domain featureMonogenic signalImage intensity encoding strategyMulti-label classifier chain
collection DOAJ
language English
format Article
sources DOAJ
author Fan Yang
Yang Liu
Yanbin Wang
Zhijian Yin
Zhen Yang
spellingShingle Fan Yang
Yang Liu
Yanbin Wang
Zhijian Yin
Zhen Yang
MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
BMC Bioinformatics
Bioimage informatics
Protein subcellular localization
Frequency domain feature
Monogenic signal
Image intensity encoding strategy
Multi-label classifier chain
author_facet Fan Yang
Yang Liu
Yanbin Wang
Zhijian Yin
Zhen Yang
author_sort Fan Yang
title MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_short MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_full MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_fullStr MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_full_unstemmed MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_sort mic_locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-10-01
description Abstract Background Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not only should be a guiding role in drug design and development due to potential molecular targets but also be an essential role in genome annotation. Taking the current status of image-based protein subcellular localization as an example, there are three common drawbacks, i.e., obsolete datasets without updating label information, stereotypical feature descriptor on spatial domain or grey level, and single-function prediction algorithm’s limited capacity of handling single-label database. Results In this paper, a novel human protein subcellular localization prediction model MIC_Locator is proposed. Firstly, the latest datasets are collected and collated as our benchmark dataset instead of obsolete data while training prediction model. Secondly, Fourier transformation, Riesz transformation, Log-Gabor filter and intensity coding strategy are employed to obtain frequency feature based on three components of monogenic signal with different frequency scales. Thirdly, a chained prediction model is proposed to handle multi-label instead of single-label datasets. The experiment results showed that the MIC_Locator can achieve 60.56% subset accuracy and outperform the existing majority of prediction models, and the frequency feature and intensity coding strategy can be conducive to improving the classification accuracy. Conclusions Our results demonstrate that the frequency feature is more beneficial for improving the performance of model compared to features extracted from spatial domain, and the MIC_Locator proposed in this paper can speed up validation of protein annotation, knowledge of protein function and proteomics research.
topic Bioimage informatics
Protein subcellular localization
Frequency domain feature
Monogenic signal
Image intensity encoding strategy
Multi-label classifier chain
url http://link.springer.com/article/10.1186/s12859-019-3136-3
work_keys_str_mv AT fanyang miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT yangliu miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT yanbinwang miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT zhijianyin miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT zhenyang miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
_version_ 1724519773269655552