Critical Evaluation of Validation Rules Automated Extraction from Data

The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implem...

Full description

Bibliographic Details
Main Author: David Pejcoch
Format: Article
Language:English
Published: Czech Society of Systems Integration 2014-10-01
Series:Journal of Systems Integration
Subjects:
Online Access:http://www.si-journal.org/index.php/JSI/article/viewFile/212/162
id doaj-129a875685ed41109c8af114628cbc30
record_format Article
spelling doaj-129a875685ed41109c8af114628cbc302020-11-24T23:25:17ZengCzech Society of Systems IntegrationJournal of Systems Integration1804-27242014-10-01543246Critical Evaluation of Validation Rules Automated Extraction from Data David Pejcoch The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implemented in LISpMiner System was chosen. A motivation for this task is the potential simplification of projects focused on Data Quality Management. Initially, this article is going to critically evaluate a possibility of fully automated extraction with the aim to identify strengths and weaknesses of this approach in comparison to its alternative, when at least some a priori knowledge is available. As a result of practical implementation, this article provides design of recommended process which would be used as a guideline for future projects. Also the question of how to store and maintain extracted rules and how to integrate them with existing tools supporting Data Quality Management is discussedhttp://www.si-journal.org/index.php/JSI/article/viewFile/212/162Data QualityData Validationvalidation rulesCanonical Data Model4FT-MinerGUHADroolsbusiness rules
collection DOAJ
language English
format Article
sources DOAJ
author David Pejcoch
spellingShingle David Pejcoch
Critical Evaluation of Validation Rules Automated Extraction from Data
Journal of Systems Integration
Data Quality
Data Validation
validation rules
Canonical Data Model
4FT-Miner
GUHA
Drools
business rules
author_facet David Pejcoch
author_sort David Pejcoch
title Critical Evaluation of Validation Rules Automated Extraction from Data
title_short Critical Evaluation of Validation Rules Automated Extraction from Data
title_full Critical Evaluation of Validation Rules Automated Extraction from Data
title_fullStr Critical Evaluation of Validation Rules Automated Extraction from Data
title_full_unstemmed Critical Evaluation of Validation Rules Automated Extraction from Data
title_sort critical evaluation of validation rules automated extraction from data
publisher Czech Society of Systems Integration
series Journal of Systems Integration
issn 1804-2724
publishDate 2014-10-01
description The goal of this article is to critically evaluate a possibility of automatic extraction of such kind of rules which could be later used within a Data Quality Management process for validation of records newly incoming to Information System. For practical demonstration the 4FT-Miner procedure implemented in LISpMiner System was chosen. A motivation for this task is the potential simplification of projects focused on Data Quality Management. Initially, this article is going to critically evaluate a possibility of fully automated extraction with the aim to identify strengths and weaknesses of this approach in comparison to its alternative, when at least some a priori knowledge is available. As a result of practical implementation, this article provides design of recommended process which would be used as a guideline for future projects. Also the question of how to store and maintain extracted rules and how to integrate them with existing tools supporting Data Quality Management is discussed
topic Data Quality
Data Validation
validation rules
Canonical Data Model
4FT-Miner
GUHA
Drools
business rules
url http://www.si-journal.org/index.php/JSI/article/viewFile/212/162
work_keys_str_mv AT davidpejcoch criticalevaluationofvalidationrulesautomatedextractionfromdata
_version_ 1725558269912547328