A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules

Data-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. Howev...

Full description

Bibliographic Details
Main Authors: Maqbool Ali, Rahman Ali, Wajahat Ali Khan, Soyeon Caren Han, Jaehun Bang, Taeho Hur, Dohyeong Kim, Sungyoung Lee, Byeong Ho Kang
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8319403/
id doaj-d31435bf49df48f6a98f0668b07958c6
record_format Article
spelling doaj-d31435bf49df48f6a98f0668b07958c62021-03-29T20:48:17ZengIEEEIEEE Access2169-35362018-01-016155871560710.1109/ACCESS.2018.28170228319403A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production RulesMaqbool Ali0https://orcid.org/0000-0002-4107-7122Rahman Ali1Wajahat Ali Khan2Soyeon Caren Han3Jaehun Bang4Taeho Hur5Dohyeong Kim6Sungyoung Lee7Byeong Ho Kang8Department of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaQuaid-e-Azam College of Commerce, University of Peshawar, Peshawar, PakistanDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaSchool of Information Technologies, The University of Sydney, Sydney, NSW, AustraliaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaSchool of Engineering and ICT, University of Tasmania, Hobart, TAS, AustraliaData-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. However, these methodologies generally target a specific aspect of the data mining process, such as data acquisition, data preprocessing, or data classification. However, a comprehensive knowledge acquisition method is crucial to support the end-to-end knowledge engineering process. In this paper, we introduce a knowledge acquisition system that covers all major phases of the cross-industry standard process for data mining. Acknowledging the importance of an end-to-end knowledge engineering process, we designed and developed an easy-to-use data-driven knowledge acquisition tool (DDKAT). The major features of the DDKAT are: (1) a novel unified features scoring approach for data selection; (2) a user-friendly data processing interface to improve the quality of the raw data; (3) an appropriate decision tree algorithm selection approach to build a classification model; and (4) the generation of production rules from various decision tree classification models in an automated manner. Furthermore, two diabetes studies were performed to assess the value of the DDKAT in terms of user experience. A total of 19 experts were involved in the first study and 102 students in the artificial intelligence domain were involved in the second study. The results showed that the overall user experience of the DDKAT was positive in terms of its attractiveness, as well as its pragmatic and hedonic quality factors.https://ieeexplore.ieee.org/document/8319403/Knowledge engineeringdata miningfeatures rankingalgorithm selectiondecision treeproduction rule
collection DOAJ
language English
format Article
sources DOAJ
author Maqbool Ali
Rahman Ali
Wajahat Ali Khan
Soyeon Caren Han
Jaehun Bang
Taeho Hur
Dohyeong Kim
Sungyoung Lee
Byeong Ho Kang
spellingShingle Maqbool Ali
Rahman Ali
Wajahat Ali Khan
Soyeon Caren Han
Jaehun Bang
Taeho Hur
Dohyeong Kim
Sungyoung Lee
Byeong Ho Kang
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
IEEE Access
Knowledge engineering
data mining
features ranking
algorithm selection
decision tree
production rule
author_facet Maqbool Ali
Rahman Ali
Wajahat Ali Khan
Soyeon Caren Han
Jaehun Bang
Taeho Hur
Dohyeong Kim
Sungyoung Lee
Byeong Ho Kang
author_sort Maqbool Ali
title A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
title_short A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
title_full A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
title_fullStr A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
title_full_unstemmed A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
title_sort data-driven knowledge acquisition system: an end-to-end knowledge engineering process for generating production rules
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Data-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. However, these methodologies generally target a specific aspect of the data mining process, such as data acquisition, data preprocessing, or data classification. However, a comprehensive knowledge acquisition method is crucial to support the end-to-end knowledge engineering process. In this paper, we introduce a knowledge acquisition system that covers all major phases of the cross-industry standard process for data mining. Acknowledging the importance of an end-to-end knowledge engineering process, we designed and developed an easy-to-use data-driven knowledge acquisition tool (DDKAT). The major features of the DDKAT are: (1) a novel unified features scoring approach for data selection; (2) a user-friendly data processing interface to improve the quality of the raw data; (3) an appropriate decision tree algorithm selection approach to build a classification model; and (4) the generation of production rules from various decision tree classification models in an automated manner. Furthermore, two diabetes studies were performed to assess the value of the DDKAT in terms of user experience. A total of 19 experts were involved in the first study and 102 students in the artificial intelligence domain were involved in the second study. The results showed that the overall user experience of the DDKAT was positive in terms of its attractiveness, as well as its pragmatic and hedonic quality factors.
topic Knowledge engineering
data mining
features ranking
algorithm selection
decision tree
production rule
url https://ieeexplore.ieee.org/document/8319403/
work_keys_str_mv AT maqboolali adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT rahmanali adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT wajahatalikhan adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT soyeoncarenhan adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT jaehunbang adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT taehohur adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT dohyeongkim adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT sungyounglee adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT byeonghokang adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT maqboolali datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT rahmanali datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT wajahatalikhan datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT soyeoncarenhan datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT jaehunbang datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT taehohur datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT dohyeongkim datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT sungyounglee datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
AT byeonghokang datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules
_version_ 1724194192243032064