Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets
Knowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, i...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Zagreb Faculty of Electrical Engineering and Computing
2016-03-01
|
Series: | Journal of Computing and Information Technology |
Subjects: | |
Online Access: | http://cit.fer.hr/index.php/CIT/article/view/2500/2056 |
id |
doaj-6e1ad4807af545a68d90c85c12aca6c4 |
---|---|
record_format |
Article |
spelling |
doaj-6e1ad4807af545a68d90c85c12aca6c42020-11-24T20:41:17ZengUniversity of Zagreb Faculty of Electrical Engineering and ComputingJournal of Computing and Information Technology1330-11361846-39082016-03-01241657810.20532/cit.2016.1002500Knowledge-based Systems and Interestingness Measures: Analysis with Clinical DatasetsJabez J. Christopher0Khanna H. Nehemiah1Kannan Arputharaj2Ramanujan Computing Centre, Anna UniversityRamanujan Computing Centre, Anna UniversityDepartment of Information Science and Technology, Anna UniversityKnowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, is still a broad area of research. This work analyses the variation in classification accuracy for such knowledge-based systems using different rule lists. The purpose of this work is not to improve the prediction accuracy of a decision support system, but analyze the factors that influence the efficiency and design of the knowledge base in a rule-based decision support system. Three benchmark medical datasets are used. Rules are extracted using a supervised machine learning algorithm (PART). Each rule in the ruleset is validated using nine frequently used rule interestingness measures. After calculating the measure values, the rule lists are used for performance evaluation. Experimental results show variation in classification accuracy for different rule lists. Confidence and Laplace measures yield relatively superior accuracy: 81.188% for heart disease dataset and 78.255% for diabetes dataset. The accuracy of the knowledge-based prediction system is predominantly dependent on the organization of the ruleset. Rule length needs to be considered when deciding the rule ordering. Subset of a rule, or combination of rule elements, may form new rules and sometimes be a member of the rule list. Redundant rules should be eliminated. Prior knowledge about the domain will enable knowledge engineers to design a better knowledge base. http://cit.fer.hr/index.php/CIT/article/view/2500/2056knowledge basedecision support systemsrule-based classificationrule listinterestingeness measures |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jabez J. Christopher Khanna H. Nehemiah Kannan Arputharaj |
spellingShingle |
Jabez J. Christopher Khanna H. Nehemiah Kannan Arputharaj Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets Journal of Computing and Information Technology knowledge base decision support systems rule-based classification rule list interestingeness measures |
author_facet |
Jabez J. Christopher Khanna H. Nehemiah Kannan Arputharaj |
author_sort |
Jabez J. Christopher |
title |
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets |
title_short |
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets |
title_full |
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets |
title_fullStr |
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets |
title_full_unstemmed |
Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets |
title_sort |
knowledge-based systems and interestingness measures: analysis with clinical datasets |
publisher |
University of Zagreb Faculty of Electrical Engineering and Computing |
series |
Journal of Computing and Information Technology |
issn |
1330-1136 1846-3908 |
publishDate |
2016-03-01 |
description |
Knowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, is still a broad area of research. This work analyses the variation in classification accuracy for such knowledge-based systems using different rule lists. The purpose of this work is not to improve the prediction accuracy of a decision support system, but analyze the factors that influence the efficiency and design of the knowledge base in a rule-based decision support system. Three benchmark medical datasets are used. Rules are extracted using a supervised machine learning algorithm (PART). Each rule in the ruleset is validated using nine frequently used rule interestingness measures. After calculating the measure values, the rule lists are used for performance evaluation. Experimental results show variation in classification accuracy for different rule lists. Confidence and Laplace measures yield relatively superior accuracy: 81.188% for heart disease dataset and 78.255% for diabetes dataset. The accuracy of the knowledge-based prediction system is predominantly dependent on the organization of the ruleset. Rule length needs to be considered when deciding the rule ordering. Subset of a rule, or combination of rule elements, may form new rules and sometimes be a member of the rule list. Redundant rules should be eliminated. Prior knowledge about the domain will enable knowledge engineers to design a better knowledge base.
|
topic |
knowledge base decision support systems rule-based classification rule list interestingeness measures |
url |
http://cit.fer.hr/index.php/CIT/article/view/2500/2056 |
work_keys_str_mv |
AT jabezjchristopher knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets AT khannahnehemiah knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets AT kannanarputharaj knowledgebasedsystemsandinterestingnessmeasuresanalysiswithclinicaldatasets |
_version_ |
1716825708583452672 |