Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets

Knowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, i...

Full description

Bibliographic Details
Main Authors: Jabez J. Christopher, Khanna H. Nehemiah, Kannan Arputharaj
Format: Article
Language:English
Published: University of Zagreb Faculty of Electrical Engineering and Computing 2016-03-01
Series:Journal of Computing and Information Technology
Subjects:
Online Access:http://cit.fer.hr/index.php/CIT/article/view/2500/2056
id doaj-6e1ad4807af545a68d90c85c12aca6c4
record_format Article
spelling doaj-6e1ad4807af545a68d90c85c12aca6c42020-11-24T20:41:17ZengUniversity of Zagreb Faculty of Electrical Engineering and ComputingJournal of Computing and Information Technology1330-11361846-39082016-03-01241657810.20532/cit.2016.1002500Knowledge-based Systems and Interestingness Measures: Analysis with Clinical DatasetsJabez J. Christopher0Khanna H. Nehemiah1Kannan Arputharaj2Ramanujan Computing Centre, Anna UniversityRamanujan Computing Centre, Anna UniversityDepartment of Information Science and Technology, Anna UniversityKnowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, is still a broad area of research. This work analyses the variation in classification accuracy for such knowledge-based systems using different rule lists. The purpose of this work is not to improve the prediction accuracy of a decision support system, but analyze the factors that influence the efficiency and design of the knowledge base in a rule-based decision support system. Three benchmark medical datasets are used. Rules are extracted using a supervised machine learning algorithm (PART). Each rule in the ruleset is validated using nine frequently used rule interestingness measures. After calculating the measure values, the rule lists are used for performance evaluation. Experimental results show variation in classification accuracy for different rule lists. Confidence and Laplace measures yield relatively superior accuracy: 81.188% for heart disease dataset and 78.255% for diabetes dataset. The accuracy of the knowledge-based prediction system is predominantly dependent on the organization of the ruleset. Rule length needs to be considered when deciding the rule ordering. Subset of a rule, or combination of rule elements, may form new rules and sometimes be a member of the rule list. Redundant rules should be eliminated. Prior knowledge about the domain will enable knowledge engineers to design a better knowledge base. http://cit.fer.hr/index.php/CIT/article/view/2500/2056knowledge basedecision support systemsrule-based classificationrule listinterestingeness measures
collection DOAJ
language English
format Article
sources DOAJ
author Jabez J. Christopher
Khanna H. Nehemiah
Kannan Arputharaj
spellingShingle Jabez J. Christopher
Khanna H. Nehemiah
Kannan Arputharaj
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
Journal of Computing and Information Technology
knowledge base
decision support systems
rule-based classification
rule list
interestingeness measures
author_facet Jabez J. Christopher
Khanna H. Nehemiah
Kannan Arputharaj
author_sort Jabez J. Christopher
title Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
title_short Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
title_full Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
title_fullStr Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
title_full_unstemmed Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
title_sort knowledge-based systems and interestingness measures: analysis with clinical datasets
publisher University of Zagreb Faculty of Electrical Engineering and Computing
series Journal of Computing and Information Technology
issn 1330-1136
1846-3908
publishDate 2016-03-01
description Knowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, is still a broad area of research. This work analyses the variation in classification accuracy for such knowledge-based systems using different rule lists. The purpose of this work is not to improve the prediction accuracy of a decision support system, but analyze the factors that influence the efficiency and design of the knowledge base in a rule-based decision support system. Three benchmark medical datasets are used. Rules are extracted using a supervised machine learning algorithm (PART). Each rule in the ruleset is validated using nine frequently used rule interestingness measures. After calculating the measure values, the rule lists are used for performance evaluation. Experimental results show variation in classification accuracy for different rule lists. Confidence and Laplace measures yield relatively superior accuracy: 81.188% for heart disease dataset and 78.255% for diabetes dataset. The accuracy of the knowledge-based prediction system is predominantly dependent on the organization of the ruleset. Rule length needs to be considered when deciding the rule ordering. Subset of a rule, or combination of rule elements, may form new rules and sometimes be a member of the rule list. Redundant rules should be eliminated. Prior knowledge about the domain will enable knowledge engineers to design a better knowledge base.
topic knowledge base
decision support systems
rule-based classification
rule list
interestingeness measures
url http://cit.fer.hr/index.php/CIT/article/view/2500/2056
work_keys_str_mv AT jabezjchristopher knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets
AT khannahnehemiah knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets
AT kannanarputharaj knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets
_version_ 1716825708583452672