Application of biclustering algorithm to extract rules from labeled data
Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very u...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Emerald Publishing
2018-11-01
|
Series: | International Journal of Crowd Science |
Subjects: | |
Online Access: | https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002 |
id |
doaj-df8a7e73f8bd4bed9f3f5b9552ef81af |
---|---|
record_format |
Article |
spelling |
doaj-df8a7e73f8bd4bed9f3f5b9552ef81af2020-11-25T01:33:43ZengEmerald PublishingInternational Journal of Crowd Science2398-72942018-11-0122869810.1108/IJCS-01-2018-0002610919Application of biclustering algorithm to extract rules from labeled dataZhang Yanjie0Sun Hongbo1School of Computer Science and Control Engineering, Yantai University, Yantai, ChinaSchool of Computer and Control Engineering, Yantai University, Yantai, ChinaPurpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002Biclustering algorithmCrowdsourced big data and analyticsRule extraction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhang Yanjie Sun Hongbo |
spellingShingle |
Zhang Yanjie Sun Hongbo Application of biclustering algorithm to extract rules from labeled data International Journal of Crowd Science Biclustering algorithm Crowdsourced big data and analytics Rule extraction |
author_facet |
Zhang Yanjie Sun Hongbo |
author_sort |
Zhang Yanjie |
title |
Application of biclustering algorithm to extract rules from labeled data |
title_short |
Application of biclustering algorithm to extract rules from labeled data |
title_full |
Application of biclustering algorithm to extract rules from labeled data |
title_fullStr |
Application of biclustering algorithm to extract rules from labeled data |
title_full_unstemmed |
Application of biclustering algorithm to extract rules from labeled data |
title_sort |
application of biclustering algorithm to extract rules from labeled data |
publisher |
Emerald Publishing |
series |
International Journal of Crowd Science |
issn |
2398-7294 |
publishDate |
2018-11-01 |
description |
Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy. |
topic |
Biclustering algorithm Crowdsourced big data and analytics Rule extraction |
url |
https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002 |
work_keys_str_mv |
AT zhangyanjie applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata AT sunhongbo applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata |
_version_ |
1725076220944580608 |