Improve Data Quality By Using Dependencies And Regular Expressions

The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functiona...

Full description

Bibliographic Details
Main Author: Feng, Yuan
Format: Others
Language:English
Published: Mittuniversitetet, Avdelningen för informationssystem och -teknologi 2018
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620
id ndltd-UPSALLA1-oai-DiVA.org-miun-35620
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-miun-356202019-02-13T05:51:20ZImprove Data Quality By Using Dependencies And Regular ExpressionsengFeng, YuanMittuniversitetet, Avdelningen för informationssystem och -teknologi2018data cleaningdata qualitycondition functional dependencyregular expressionComputer SystemsDatorsystemThe objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620Local DT-V18-A2-004application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic data cleaning
data quality
condition functional dependency
regular expression
Computer Systems
Datorsystem
spellingShingle data cleaning
data quality
condition functional dependency
regular expression
Computer Systems
Datorsystem
Feng, Yuan
Improve Data Quality By Using Dependencies And Regular Expressions
description The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency.
author Feng, Yuan
author_facet Feng, Yuan
author_sort Feng, Yuan
title Improve Data Quality By Using Dependencies And Regular Expressions
title_short Improve Data Quality By Using Dependencies And Regular Expressions
title_full Improve Data Quality By Using Dependencies And Regular Expressions
title_fullStr Improve Data Quality By Using Dependencies And Regular Expressions
title_full_unstemmed Improve Data Quality By Using Dependencies And Regular Expressions
title_sort improve data quality by using dependencies and regular expressions
publisher Mittuniversitetet, Avdelningen för informationssystem och -teknologi
publishDate 2018
url http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620
work_keys_str_mv AT fengyuan improvedataqualitybyusingdependenciesandregularexpressions
_version_ 1718975906962210816