Application of biclustering algorithm to extract rules from labeled data

Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very u...

Full description

Bibliographic Details
Main Authors:	Zhang Yanjie, Sun Hongbo
Format:	Article
Language:	English
Published:	Emerald Publishing 2018-11-01
Series:	International Journal of Crowd Science
Subjects:	Biclustering algorithm Crowdsourced big data and analytics Rule extraction
Online Access:	https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002

id	doaj-df8a7e73f8bd4bed9f3f5b9552ef81af
record_format	Article
spelling	doaj-df8a7e73f8bd4bed9f3f5b9552ef81af2020-11-25T01:33:43ZengEmerald PublishingInternational Journal of Crowd Science2398-72942018-11-0122869810.1108/IJCS-01-2018-0002610919Application of biclustering algorithm to extract rules from labeled dataZhang Yanjie0Sun Hongbo1School of Computer Science and Control Engineering, Yantai University, Yantai, ChinaSchool of Computer and Control Engineering, Yantai University, Yantai, ChinaPurpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002Biclustering algorithmCrowdsourced big data and analyticsRule extraction
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zhang Yanjie Sun Hongbo
spellingShingle	Zhang Yanjie Sun Hongbo Application of biclustering algorithm to extract rules from labeled data International Journal of Crowd Science Biclustering algorithm Crowdsourced big data and analytics Rule extraction
author_facet	Zhang Yanjie Sun Hongbo
author_sort	Zhang Yanjie
title	Application of biclustering algorithm to extract rules from labeled data
title_short	Application of biclustering algorithm to extract rules from labeled data
title_full	Application of biclustering algorithm to extract rules from labeled data
title_fullStr	Application of biclustering algorithm to extract rules from labeled data
title_full_unstemmed	Application of biclustering algorithm to extract rules from labeled data
title_sort	application of biclustering algorithm to extract rules from labeled data
publisher	Emerald Publishing
series	International Journal of Crowd Science
issn	2398-7294
publishDate	2018-11-01
description	Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.
topic	Biclustering algorithm Crowdsourced big data and analytics Rule extraction
url	https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002
work_keys_str_mv	AT zhangyanjie applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata AT sunhongbo applicationofbiclusteringalgorithmtoextractrulesfromlabeleddata
_version_	1725076220944580608

Application of biclustering algorithm to extract rules from labeled data

Similar Items