Improve Data Quality By Using Dependencies And Regular Expressions
The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functiona...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Mittuniversitetet, Avdelningen för informationssystem och -teknologi
2018
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620 |
id |
ndltd-UPSALLA1-oai-DiVA.org-miun-35620 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-miun-356202019-02-13T05:51:20ZImprove Data Quality By Using Dependencies And Regular ExpressionsengFeng, YuanMittuniversitetet, Avdelningen för informationssystem och -teknologi2018data cleaningdata qualitycondition functional dependencyregular expressionComputer SystemsDatorsystemThe objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620Local DT-V18-A2-004application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
data cleaning data quality condition functional dependency regular expression Computer Systems Datorsystem |
spellingShingle |
data cleaning data quality condition functional dependency regular expression Computer Systems Datorsystem Feng, Yuan Improve Data Quality By Using Dependencies And Regular Expressions |
description |
The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency. |
author |
Feng, Yuan |
author_facet |
Feng, Yuan |
author_sort |
Feng, Yuan |
title |
Improve Data Quality By Using Dependencies And Regular Expressions |
title_short |
Improve Data Quality By Using Dependencies And Regular Expressions |
title_full |
Improve Data Quality By Using Dependencies And Regular Expressions |
title_fullStr |
Improve Data Quality By Using Dependencies And Regular Expressions |
title_full_unstemmed |
Improve Data Quality By Using Dependencies And Regular Expressions |
title_sort |
improve data quality by using dependencies and regular expressions |
publisher |
Mittuniversitetet, Avdelningen för informationssystem och -teknologi |
publishDate |
2018 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-35620 |
work_keys_str_mv |
AT fengyuan improvedataqualitybyusingdependenciesandregularexpressions |
_version_ |
1718975906962210816 |