A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose

碩士 === 國立臺灣師範大學 === 圖書資訊學研究所 === 98 === Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analy...

Full description

Bibliographic Details
Main Authors: Chen Chien Chieh, 陳建傑
Other Authors: Shieh Jiann Cherng
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/56971463191617260252
id ndltd-TW-098NTNU5447028
record_format oai_dc
spelling ndltd-TW-098NTNU54470282015-10-13T18:35:09Z http://ndltd.ncl.edu.tw/handle/56971463191617260252 A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose 基於借閱目的之資料清理機制研究-以興趣目的為例 Chen Chien Chieh 陳建傑 碩士 國立臺灣師範大學 圖書資訊學研究所 98 Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analyze the patron’s borrowing history in order to adjust and improve its services, but often does not consider the patron’s purpose behind borrowing his or her information from the library. Most patrons have several reasons behind their borrowings, and it is may create erroneous results if we don’t clean it before analyzing. In this paper we analyze the effectiveness of a heuristic data-cleaning approach to remove the areas of non-interest in the patron’s historical loan record. Meanwhile, we also use F-Measure analysis to evaluate the results in order to suggest suitable cleaning methods. In addition, personal cleaning processes for patrons is implemented by adjusting the parameters of the clean-up mechanisms. From the study results, the patron’s borrowing history cannot be easily cleaned based on interest purposes, but you can attempt to clean the data by the E-M algorithm using cluster analysis, and use the properties of third tier classification: number, loan date, and author. Using personal cleaning, it is concluded that adjustments in the parameters could produce more satisfying results. In addition, if use F-Measure, more interesting parts in the patron’s borrowing history, the cleaning process will be more difficult. Shieh Jiann Cherng 謝建成 2010 學位論文 ; thesis 46 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣師範大學 === 圖書資訊學研究所 === 98 === Researchers often use statistics from previous events to serve as a basis for analysis, but the acquired data usually has its problems, which in turn may reduce the efficiency of the researcher’s analysis or even create erroneous results. Libraries often analyze the patron’s borrowing history in order to adjust and improve its services, but often does not consider the patron’s purpose behind borrowing his or her information from the library. Most patrons have several reasons behind their borrowings, and it is may create erroneous results if we don’t clean it before analyzing. In this paper we analyze the effectiveness of a heuristic data-cleaning approach to remove the areas of non-interest in the patron’s historical loan record. Meanwhile, we also use F-Measure analysis to evaluate the results in order to suggest suitable cleaning methods. In addition, personal cleaning processes for patrons is implemented by adjusting the parameters of the clean-up mechanisms. From the study results, the patron’s borrowing history cannot be easily cleaned based on interest purposes, but you can attempt to clean the data by the E-M algorithm using cluster analysis, and use the properties of third tier classification: number, loan date, and author. Using personal cleaning, it is concluded that adjustments in the parameters could produce more satisfying results. In addition, if use F-Measure, more interesting parts in the patron’s borrowing history, the cleaning process will be more difficult.
author2 Shieh Jiann Cherng
author_facet Shieh Jiann Cherng
Chen Chien Chieh
陳建傑
author Chen Chien Chieh
陳建傑
spellingShingle Chen Chien Chieh
陳建傑
A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
author_sort Chen Chien Chieh
title A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
title_short A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
title_full A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
title_fullStr A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
title_full_unstemmed A Study of Data Cleaning Mechanisms Based on Borrowing Purposes -The Case Study of Interesting Purpose
title_sort study of data cleaning mechanisms based on borrowing purposes -the case study of interesting purpose
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/56971463191617260252
work_keys_str_mv AT chenchienchieh astudyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose
AT chénjiànjié astudyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose
AT chenchienchieh jīyújièyuèmùdezhīzīliàoqīnglǐjīzhìyánjiūyǐxìngqùmùdewèilì
AT chénjiànjié jīyújièyuèmùdezhīzīliàoqīnglǐjīzhìyánjiūyǐxìngqùmùdewèilì
AT chenchienchieh studyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose
AT chénjiànjié studyofdatacleaningmechanismsbasedonborrowingpurposesthecasestudyofinterestingpurpose
_version_ 1718034241063747584