A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
碩士 === 國立臺灣師範大學 === 圖書資訊學研究所 === 98 === Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analy...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/56971463191617260252 |
id |
ndltd-TW-098NTNU5447028 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NTNU54470282015-10-13T18:35:09Z http://ndltd.ncl.edu.tw/handle/56971463191617260252 A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose 基於借閱目的之資料清理機制研究-以興趣目的為例 Chen Chien Chieh 陳建傑 碩士 國立臺灣師範大學 圖書資訊學研究所 98 Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analyze the patron’s borrowing history in order to adjust and improve its services, but often does not consider the patron’s purpose behind borrowing his or her information from the library. Most patrons have several reasons behind their borrowings, and it is may create erroneous results if we don’t clean it before analyzing. In this paper we analyze the effectiveness of a heuristic data-cleaning approach to remove the areas of non-interest in the patron’s historical loan record. Meanwhile, we also use F-Measure analysis to evaluate the results in order to suggest suitable cleaning methods. In addition, personal cleaning processes for patrons is implemented by adjusting the parameters of the clean-up mechanisms. From the study results, the patron’s borrowing history cannot be easily cleaned based on interest purposes, but you can attempt to clean the data by the E-M algorithm using cluster analysis, and use the properties of third tier classification: number, loan date, and author. Using personal cleaning, it is concluded that adjustments in the parameters could produce more satisfying results. In addition, if use F-Measure, more interesting parts in the patron’s borrowing history, the cleaning process will be more difficult. Shieh Jiann Cherng 謝建成 2010 學位論文 ; thesis 46 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣師範大學 === 圖書資訊學研究所 === 98 === Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analyze the patron’s borrowing history in order to adjust and improve its services, but often does not consider the patron’s purpose behind borrowing his or her information from the library. Most patrons have several reasons behind their borrowings, and it is may create erroneous results if we don’t clean it before analyzing.
In this paper we analyze the effectiveness of a heuristic data-cleaning approach to remove the areas of non-interest in the patron’s historical loan record. Meanwhile, we also use F-Measure analysis to evaluate the results in order to suggest suitable cleaning methods. In addition, personal cleaning processes for patrons is implemented by adjusting the parameters of the clean-up mechanisms.
From the study results, the patron’s borrowing history cannot be easily cleaned based on interest purposes, but you can attempt to clean the data by the E-M algorithm using cluster analysis, and use the properties of third tier classification: number, loan date, and author. Using personal cleaning, it is concluded that adjustments in the parameters could produce more satisfying results. In addition, if use F-Measure, more interesting parts in the patron’s borrowing history, the cleaning process will be more difficult.
|
author2 |
Shieh Jiann Cherng |
author_facet |
Shieh Jiann Cherng Chen Chien Chieh 陳建傑 |
author |
Chen Chien Chieh 陳建傑 |
spellingShingle |
Chen Chien Chieh 陳建傑 A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
author_sort |
Chen Chien Chieh |
title |
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
title_short |
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
title_full |
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
title_fullStr |
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
title_full_unstemmed |
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose |
title_sort |
study of data cleaning mechanisms based on borrowing purposes -the case study of interesting purpose |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/56971463191617260252 |
work_keys_str_mv |
AT chenchienchieh astudyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose AT chénjiànjié astudyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose AT chenchienchieh jīyújièyuèmùdezhīzīliàoqīnglǐjīzhìyánjiūyǐxìngqùmùdewèilì AT chénjiànjié jīyújièyuèmùdezhīzīliàoqīnglǐjīzhìyánjiūyǐxìngqùmùdewèilì AT chenchienchieh studyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose AT chénjiànjié studyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose |
_version_ |
1718034241063747584 |