Early Blocking and Bypassing for Accelerating Web Content Filtering
碩士 === 國立交通大學 === 資訊科學系 === 91 === Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2003
|
Online Access: | http://ndltd.ncl.edu.tw/handle/13074375671188416253 |
id |
ndltd-TW-091NCTU0394047 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-091NCTU03940472016-06-22T04:14:06Z http://ndltd.ncl.edu.tw/handle/13074375671188416253 Early Blocking and Bypassing for Accelerating Web Content Filtering 網頁內容過濾初期阻擋與通過之加速演算法 Ming-Dao Liu 劉明道 碩士 國立交通大學 資訊科學系 91 Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to accelerate the classification process. The former algorithm allows making the blocking decision as early as we have enough confidence that the Web document should belong to some forbidden category, while the latter helps to make the bypassing decision as soon as the document is considered as a normal one. Experiments performed on NetBSD 1.6 with Pentinum III 700 MHz CPU show our algorithms can improve the throughput over four times higher than the original Bayesian classifier, while the F1 measure shows that the accuracy remains fairly good: 92% in forbidden traffic and 96% in normal traffic. Ying-Dar Lin 林盈達 2003 學位論文 ; thesis 32 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 資訊科學系 === 91 === Real-time content analysis is an important technique in Web content filtering. However, it may also suffer lower accuracy and longer processing time. In this work, we present two algorithms named early blocking and early bypassing based on the Naïve Bayes method to accelerate the classification process. The former algorithm allows making the blocking decision as early as we have enough confidence that the Web document should belong to some forbidden category, while the latter helps to make the bypassing decision as soon as the document is considered as a normal one. Experiments performed on NetBSD 1.6 with Pentinum III 700 MHz CPU show our algorithms can improve the throughput over four times higher than the original Bayesian classifier, while the F1 measure shows that the accuracy remains fairly good: 92% in forbidden traffic and 96% in normal traffic.
|
author2 |
Ying-Dar Lin |
author_facet |
Ying-Dar Lin Ming-Dao Liu 劉明道 |
author |
Ming-Dao Liu 劉明道 |
spellingShingle |
Ming-Dao Liu 劉明道 Early Blocking and Bypassing for Accelerating Web Content Filtering |
author_sort |
Ming-Dao Liu |
title |
Early Blocking and Bypassing for Accelerating Web Content Filtering |
title_short |
Early Blocking and Bypassing for Accelerating Web Content Filtering |
title_full |
Early Blocking and Bypassing for Accelerating Web Content Filtering |
title_fullStr |
Early Blocking and Bypassing for Accelerating Web Content Filtering |
title_full_unstemmed |
Early Blocking and Bypassing for Accelerating Web Content Filtering |
title_sort |
early blocking and bypassing for accelerating web content filtering |
publishDate |
2003 |
url |
http://ndltd.ncl.edu.tw/handle/13074375671188416253 |
work_keys_str_mv |
AT mingdaoliu earlyblockingandbypassingforacceleratingwebcontentfiltering AT liúmíngdào earlyblockingandbypassingforacceleratingwebcontentfiltering AT mingdaoliu wǎngyènèiróngguòlǜchūqīzǔdǎngyǔtōngguòzhījiāsùyǎnsuànfǎ AT liúmíngdào wǎngyènèiróngguòlǜchūqīzǔdǎngyǔtōngguòzhījiāsùyǎnsuànfǎ |
_version_ |
1718315047223033856 |