Summary: | 碩士 === 淡江大學 === 資訊工程學系碩士班 === 93 === Data preprocessing is an important procedure in web usage mining. In this paper, we will discuss some major questions in data preprocessing, and then provide some methods to help to solve these problems.
In a web usage mining process, if we do not complete the web structure analysis at first, then we cannot truly complete data preprocessing, as well seriously affects the accuracy in pattern discovery.
Therefore, in the present paper, we utilize Stochastic Timed Petri Nets (STPN) and its reachability behavior characteristic, as well as the constructed web structure which produces after the web structure analysis, to help web content scope recognization and path completion procedure.
|