Anomaly Detection via Over-sampling Principal Component Analysis

碩士 === 國立臺灣科技大學 === 資訊工程系 === 97 === Outlier detection is an important issue in data mining and has been studied in different research areas. It can be used for detecting a small amount of deviated data. In this article, we use ``Leave One Out" procedure to check each individual point for the `...

Full description

Bibliographic Details
Main Authors: Zheng-Yi Lee, 李政益
Other Authors: Yuh-Jye Lee
Format: Others
Language:en_US
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/31159755782188347722
Description
Summary:碩士 === 國立臺灣科技大學 === 資訊工程系 === 97 === Outlier detection is an important issue in data mining and has been studied in different research areas. It can be used for detecting a small amount of deviated data. In this article, we use ``Leave One Out" procedure to check each individual point for the ``with or without" effect on the variation of principal directions. Based on this idea, an over-sampling principal component analysis outlier detection method is proposed for emphasizing the influence of an abnormal instance (or an outlier). Except for identifying the suspicious outliers, we also design an on-line anomaly detection to detect the new arriving anomaly. In addition, we also study the quick updating of the principal directions for an effective computation to satisfy the on-line detecting demand. Numerical experiments show that our proposed method is effective in computation time and anomaly detection.