A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules
Data-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. Howev...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8319403/ |
id |
doaj-d31435bf49df48f6a98f0668b07958c6 |
---|---|
record_format |
Article |
spelling |
doaj-d31435bf49df48f6a98f0668b07958c62021-03-29T20:48:17ZengIEEEIEEE Access2169-35362018-01-016155871560710.1109/ACCESS.2018.28170228319403A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production RulesMaqbool Ali0https://orcid.org/0000-0002-4107-7122Rahman Ali1Wajahat Ali Khan2Soyeon Caren Han3Jaehun Bang4Taeho Hur5Dohyeong Kim6Sungyoung Lee7Byeong Ho Kang8Department of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaQuaid-e-Azam College of Commerce, University of Peshawar, Peshawar, PakistanDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaSchool of Information Technologies, The University of Sydney, Sydney, NSW, AustraliaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaDepartment of Computer Science and Engineering, Kyung Hee University, Yongin, South KoreaSchool of Engineering and ICT, University of Tasmania, Hobart, TAS, AustraliaData-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. However, these methodologies generally target a specific aspect of the data mining process, such as data acquisition, data preprocessing, or data classification. However, a comprehensive knowledge acquisition method is crucial to support the end-to-end knowledge engineering process. In this paper, we introduce a knowledge acquisition system that covers all major phases of the cross-industry standard process for data mining. Acknowledging the importance of an end-to-end knowledge engineering process, we designed and developed an easy-to-use data-driven knowledge acquisition tool (DDKAT). The major features of the DDKAT are: (1) a novel unified features scoring approach for data selection; (2) a user-friendly data processing interface to improve the quality of the raw data; (3) an appropriate decision tree algorithm selection approach to build a classification model; and (4) the generation of production rules from various decision tree classification models in an automated manner. Furthermore, two diabetes studies were performed to assess the value of the DDKAT in terms of user experience. A total of 19 experts were involved in the first study and 102 students in the artificial intelligence domain were involved in the second study. The results showed that the overall user experience of the DDKAT was positive in terms of its attractiveness, as well as its pragmatic and hedonic quality factors.https://ieeexplore.ieee.org/document/8319403/Knowledge engineeringdata miningfeatures rankingalgorithm selectiondecision treeproduction rule |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Maqbool Ali Rahman Ali Wajahat Ali Khan Soyeon Caren Han Jaehun Bang Taeho Hur Dohyeong Kim Sungyoung Lee Byeong Ho Kang |
spellingShingle |
Maqbool Ali Rahman Ali Wajahat Ali Khan Soyeon Caren Han Jaehun Bang Taeho Hur Dohyeong Kim Sungyoung Lee Byeong Ho Kang A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules IEEE Access Knowledge engineering data mining features ranking algorithm selection decision tree production rule |
author_facet |
Maqbool Ali Rahman Ali Wajahat Ali Khan Soyeon Caren Han Jaehun Bang Taeho Hur Dohyeong Kim Sungyoung Lee Byeong Ho Kang |
author_sort |
Maqbool Ali |
title |
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules |
title_short |
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules |
title_full |
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules |
title_fullStr |
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules |
title_full_unstemmed |
A Data-Driven Knowledge Acquisition System: An End-to-End Knowledge Engineering Process for Generating Production Rules |
title_sort |
data-driven knowledge acquisition system: an end-to-end knowledge engineering process for generating production rules |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
Data-driven knowledge acquisition is one of the key research fields in data mining. Dealing with large amounts of data has received a lot of attention in the field recently, and a number of methodologies have been proposed to extract insights from data in an automated or semi-automated manner. However, these methodologies generally target a specific aspect of the data mining process, such as data acquisition, data preprocessing, or data classification. However, a comprehensive knowledge acquisition method is crucial to support the end-to-end knowledge engineering process. In this paper, we introduce a knowledge acquisition system that covers all major phases of the cross-industry standard process for data mining. Acknowledging the importance of an end-to-end knowledge engineering process, we designed and developed an easy-to-use data-driven knowledge acquisition tool (DDKAT). The major features of the DDKAT are: (1) a novel unified features scoring approach for data selection; (2) a user-friendly data processing interface to improve the quality of the raw data; (3) an appropriate decision tree algorithm selection approach to build a classification model; and (4) the generation of production rules from various decision tree classification models in an automated manner. Furthermore, two diabetes studies were performed to assess the value of the DDKAT in terms of user experience. A total of 19 experts were involved in the first study and 102 students in the artificial intelligence domain were involved in the second study. The results showed that the overall user experience of the DDKAT was positive in terms of its attractiveness, as well as its pragmatic and hedonic quality factors. |
topic |
Knowledge engineering data mining features ranking algorithm selection decision tree production rule |
url |
https://ieeexplore.ieee.org/document/8319403/ |
work_keys_str_mv |
AT maqboolali adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT rahmanali adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT wajahatalikhan adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT soyeoncarenhan adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT jaehunbang adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT taehohur adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT dohyeongkim adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT sungyounglee adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT byeonghokang adatadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT maqboolali datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT rahmanali datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT wajahatalikhan datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT soyeoncarenhan datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT jaehunbang datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT taehohur datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT dohyeongkim datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT sungyounglee datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules AT byeonghokang datadrivenknowledgeacquisitionsystemanendtoendknowledgeengineeringprocessforgeneratingproductionrules |
_version_ |
1724194192243032064 |