Critical Evaluation of Validation Rules Automated Extraction from Data
The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implem...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Czech Society of Systems Integration
2014-10-01
|
Series: | Journal of Systems Integration |
Subjects: | |
Online Access: | http://www.si-journal.org/index.php/JSI/article/viewFile/212/162 |
id |
doaj-129a875685ed41109c8af114628cbc30 |
---|---|
record_format |
Article |
spelling |
doaj-129a875685ed41109c8af114628cbc302020-11-24T23:25:17ZengCzech Society of Systems IntegrationJournal of Systems Integration1804-27242014-10-01543246Critical Evaluation of Validation Rules Automated Extraction from Data David Pejcoch The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implemented in LISpMiner System was chosen. A motivation for this task is the potential simplification of projects focused on Data Quality Management. Initially, this article is going to critically evaluate a possibility of fully automated extraction with the aim to identify strengths and weaknesses of this approach in comparison to its alternative, when at least some a priori knowledge is available. As a result of practical implementation, this article provides design of recommended process which would be used as a guideline for future projects. Also the question of how to store and maintain extracted rules and how to integrate them with existing tools supporting Data Quality Management is discussedhttp://www.si-journal.org/index.php/JSI/article/viewFile/212/162Data QualityData Validationvalidation rulesCanonical Data Model4FT-MinerGUHADroolsbusiness rules |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
David Pejcoch |
spellingShingle |
David Pejcoch Critical Evaluation of Validation Rules Automated Extraction from Data Journal of Systems Integration Data Quality Data Validation validation rules Canonical Data Model 4FT-Miner GUHA Drools business rules |
author_facet |
David Pejcoch |
author_sort |
David Pejcoch |
title |
Critical Evaluation of Validation Rules Automated Extraction from Data |
title_short |
Critical Evaluation of Validation Rules Automated Extraction from Data |
title_full |
Critical Evaluation of Validation Rules Automated Extraction from Data |
title_fullStr |
Critical Evaluation of Validation Rules Automated Extraction from Data |
title_full_unstemmed |
Critical Evaluation of Validation Rules Automated Extraction from Data |
title_sort |
critical evaluation of validation rules automated extraction from data |
publisher |
Czech Society of Systems Integration |
series |
Journal of Systems Integration |
issn |
1804-2724 |
publishDate |
2014-10-01 |
description |
The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implemented in LISpMiner System was chosen. A motivation for this task is the potential simplification of projects focused on Data Quality Management. Initially, this article is going to critically evaluate a possibility of fully automated extraction with the aim to identify strengths and weaknesses of this approach in comparison to its alternative, when at least some a priori knowledge is available. As a result of practical implementation, this article provides design of recommended process which would be used as a guideline for future projects. Also the question of how to store and maintain extracted rules and how to integrate them with existing tools supporting Data Quality Management is discussed |
topic |
Data Quality Data Validation validation rules Canonical Data Model 4FT-Miner GUHA Drools business rules |
url |
http://www.si-journal.org/index.php/JSI/article/viewFile/212/162 |
work_keys_str_mv |
AT davidpejcoch criticalevaluationofvalidationrulesautomatedextractionfromdata |
_version_ |
1725558269912547328 |