Data mining for generalized association rules and privacy preservingData mining for generalized association rules and privacy preserving

博士 === 國立雲林科技大學 === 工程科技研究所博士班 === 99 === Data mining is an analysis method used to extract the unknown and latent information that hides in large dataset which has usable information. In the last few years the data mining model and method have long-term progress and the association rule mining is m...

Full description

Bibliographic Details
Main Authors: Chieh-Ming Wu, 吳界明
Other Authors: Yin-Fu Huang
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/85543535661382633122
Description
Summary:博士 === 國立雲林科技大學 === 工程科技研究所博士班 === 99 === Data mining is an analysis method used to extract the unknown and latent information that hides in large dataset which has usable information. In the last few years the data mining model and method have long-term progress and the association rule mining is most often applied. The association rule research focus on discussion how to discover single level association rule effectiveness in the large dataset. In the recent years more and more researchers start to study the problem of multiple level association rules that was advantageous in the knowledge economy modernized society. In accordance to the enterprise, it must utilize nimbly the more deeply and more detailed association rules to assist the superintendent to complete policy-making in the short time. For reach the above objective, this study proposed an efficient data structure, Frequent Closed Enumerable Table (FCET), to speed the generalized association rules mining. In the other aspect, as a result of enterprise globalization acceleration, many sensitive individual information collection, processing and application involve to the individual privacy protection law. In addition, databases managed by enterprises also largely grow up. The databases store many individual sensitive material and corporation secret information. If the database suffers non-suitable access, it leads the security problem. Moreover, it causes the company secret restricted data and the individual material to be disclosed. Once the problem is not careful processed, it would possibly reduce the competitiveness of enterprise. This study proposes an effective data structure which considers the privacy preserving in the mining process. In addition, it carries on the complete discussion from data mining and privacy the preserving related question. A greedy algorithm which considers the hiding cost was proposed here. The algorithm includes the sanitized procedure and exposed procedure protection of mechanism. Not only privacy preserving for public content but also useful information extraction are guarantee to reach. Moreover, after the sanitized processing, it achieves privacy preserving and knowledge extracting balanced effectively.