Early Blocking and Bypassing for Accelerating Web Content Filtering

碩士 === 國立交通大學 === 資訊科學系 === 91 === Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to...

Full description

Bibliographic Details
Main Authors: Ming-Dao Liu, 劉明道
Other Authors: Ying-Dar Lin
Format: Others
Language:en_US
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/13074375671188416253
id ndltd-TW-091NCTU0394047
record_format oai_dc
spelling ndltd-TW-091NCTU03940472016-06-22T04:14:06Z http://ndltd.ncl.edu.tw/handle/13074375671188416253 Early Blocking and Bypassing for Accelerating Web Content Filtering 網頁內容過濾初期阻擋與通過之加速演算法 Ming-Dao Liu 劉明道 碩士 國立交通大學 資訊科學系 91 Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to accelerate the classification process. The former algorithm allows making the blocking decision as early as we have enough confidence that the Web document should belong to some forbidden category, while the latter helps to make the bypassing decision as soon as the document is considered as a normal one. Experiments performed on NetBSD 1.6 with Pentinum III 700 MHz CPU show our algorithms can improve the throughput over four times higher than the original Bayesian classifier, while the F1 measure shows that the accuracy remains fairly good: 92% in forbidden traffic and 96% in normal traffic. Ying-Dar Lin 林盈達 2003 學位論文 ; thesis 32 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊科學系 === 91 === Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to accelerate the classification process. The former algorithm allows making the blocking decision as early as we have enough confidence that the Web document should belong to some forbidden category, while the latter helps to make the bypassing decision as soon as the document is considered as a normal one. Experiments performed on NetBSD 1.6 with Pentinum III 700 MHz CPU show our algorithms can improve the throughput over four times higher than the original Bayesian classifier, while the F1 measure shows that the accuracy remains fairly good: 92% in forbidden traffic and 96% in normal traffic.
author2 Ying-Dar Lin
author_facet Ying-Dar Lin
Ming-Dao Liu
劉明道
author Ming-Dao Liu
劉明道
spellingShingle Ming-Dao Liu
劉明道
Early Blocking and Bypassing for Accelerating Web Content Filtering
author_sort Ming-Dao Liu
title Early Blocking and Bypassing for Accelerating Web Content Filtering
title_short Early Blocking and Bypassing for Accelerating Web Content Filtering
title_full Early Blocking and Bypassing for Accelerating Web Content Filtering
title_fullStr Early Blocking and Bypassing for Accelerating Web Content Filtering
title_full_unstemmed Early Blocking and Bypassing for Accelerating Web Content Filtering
title_sort early blocking and bypassing for accelerating web content filtering
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/13074375671188416253
work_keys_str_mv AT mingdaoliu earlyblockingandbypassingforacceleratingwebcontentfiltering
AT liúmíngdào earlyblockingandbypassingforacceleratingwebcontentfiltering
AT mingdaoliu wǎngyènèiróngguòlǜchūqīzǔdǎngyǔtōngguòzhījiāsùyǎnsuànfǎ
AT liúmíngdào wǎngyènèiróngguòlǜchūqīzǔdǎngyǔtōngguòzhījiāsùyǎnsuànfǎ
_version_ 1718315047223033856