Application of biclustering algorithm to extract rules from labeled data

Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very u...

Full description

Bibliographic Details
Main Authors: Zhang Yanjie, Sun Hongbo
Format: Article
Language:English
Published: Emerald Publishing 2018-11-01
Series:International Journal of Crowd Science
Subjects:
Online Access:https://www.emeraldinsight.com/doi/pdfplus/10.1108/IJCS-01-2018-0002
Description
Summary:Purpose - For many pattern recognition problems, the relation between the sample vectors and the class labels are known during the data acquisition procedure. However, how to find the useful rules or knowledge hidden in the data is very important and challengeable. Rule extraction methods are very useful in mining the important and heuristic knowledge hidden in the original high-dimensional data. It can help us to construct predictive models with few attributes of the data so as to provide valuable model interpretability and less training times. Design/methodology/approach - In this paper, a novel rule extraction method with the application of biclustering algorithm is proposed. Findings - To choose the most significant biclusters from the huge number of detected biclusters, a specially modified information entropy calculation method is also provided. It will be shown that all of the important knowledge is in practice hidden in these biclusters. Originality/value - The novelty of the new method lies in the detected biclusters can be conveniently translated into if-then rules. It provides an intuitively explainable and comprehensive approach to extract rules from high-dimensional data while keeping high classification accuracy.
ISSN:2398-7294