Application of biclustering algorithm to extract rules from labeled data

Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very u...

Full description

Bibliographic Details
Main Authors: Zhang Yanjie, Sun Hongbo
Format: Article
Language:English
Published: Emerald Publishing 2018-11-01
Series:International Journal of Crowd Science
Subjects:
Online Access:https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002
id doaj-df8a7e73f8bd4bed9f3f5b9552ef81af
record_format Article
spelling doaj-df8a7e73f8bd4bed9f3f5b9552ef81af2020-11-25T01:33:43ZengEmerald PublishingInternational Journal of Crowd Science2398-72942018-11-0122869810.1108/IJCS-01-2018-0002610919Application of biclustering algorithm to extract rules from labeled dataZhang Yanjie0Sun Hongbo1School of Computer Science and Control Engineering, Yantai University, Yantai, ChinaSchool of Computer and Control Engineering, Yantai University, Yantai, ChinaPurpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002Biclustering algorithmCrowdsourced big data and analyticsRule extraction
collection DOAJ
language English
format Article
sources DOAJ
author Zhang Yanjie
Sun Hongbo
spellingShingle Zhang Yanjie
Sun Hongbo
Application of biclustering algorithm to extract rules from labeled data
International Journal of Crowd Science
Biclustering algorithm
Crowdsourced big data and analytics
Rule extraction
author_facet Zhang Yanjie
Sun Hongbo
author_sort Zhang Yanjie
title Application of biclustering algorithm to extract rules from labeled data
title_short Application of biclustering algorithm to extract rules from labeled data
title_full Application of biclustering algorithm to extract rules from labeled data
title_fullStr Application of biclustering algorithm to extract rules from labeled data
title_full_unstemmed Application of biclustering algorithm to extract rules from labeled data
title_sort application of biclustering algorithm to extract rules from labeled data
publisher Emerald Publishing
series International Journal of Crowd Science
issn 2398-7294
publishDate 2018-11-01
description Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.
topic Biclustering algorithm
Crowdsourced big data and analytics
Rule extraction
url https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002
work_keys_str_mv AT zhangyanjie applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata
AT sunhongbo applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata
_version_ 1725076220944580608