P2P Flow Identification by Ensemble Classification
碩士 === 國立臺灣科技大學 === 電子工程系 === 97 === Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/21843436536868745208 |
id |
ndltd-TW-097NTUS5428175 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097NTUS54281752016-05-02T04:11:48Z http://ndltd.ncl.edu.tw/handle/21843436536868745208 P2P Flow Identification by Ensemble Classification 以組合式分類器偵測點對點傳輸資料之研究 Cheng-Ru Wu 吳承儒 碩士 國立臺灣科技大學 電子工程系 97 Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and payload information, since new-generation P2P applications can use arbitrary port number to avoid fixed-port block and use payload encryption to avoid P2P signature detection. Previous research that use machine learning approach for P2P flow identification, suffer form low detection rate and high false positive rate due to lack for proper features. In our research, we propose an ensemble classification approach, which integrates Hidden Markov Model (HMM) and Adaboost algorithm. The proposed P2P identification scheme can be divided into two stages. In the first stage, we investigated the phenomenon of small packet and large packet interchange in the P2P flow and identified an important feature, called packet size sequence pattern, and use Hidden Markov Model (HMM) to recognize the patterns. In the second stage, we use Adaboost algorithm with traditional flow attributes to promote the detection accuracy and reduce false positive in classification. To verify the performance of the proposed P2P identification based on ensemble classification, we collect network traffic traces from NTUST campus, and run intensive simulations. The simulation results show that the ensemble classification approach for P2P flow identification can achieve 98% detection rate and 5% false alarm rate. Yie-Tarng Chen 陳郁堂 2009 學位論文 ; thesis 55 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 電子工程系 === 97 === Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and payload information, since new-generation P2P applications can use arbitrary port number to avoid fixed-port block and use payload encryption to avoid P2P signature detection. Previous research that use machine learning approach for P2P flow identification, suffer form low detection rate and high false positive rate due to lack for proper features. In our research, we propose an ensemble classification approach, which integrates Hidden Markov Model (HMM) and Adaboost algorithm. The proposed P2P identification scheme can be divided into two stages. In the first stage, we investigated the phenomenon of small packet and large packet interchange in the P2P flow and identified an important feature, called packet size sequence pattern, and use Hidden Markov Model (HMM) to recognize the patterns. In the second stage, we use Adaboost algorithm with traditional flow attributes to promote the detection accuracy and reduce false positive in classification. To verify the performance of the proposed P2P identification based on ensemble classification, we collect network traffic traces from NTUST campus, and run intensive simulations. The simulation results show that the ensemble classification approach for P2P flow identification can achieve 98% detection rate and 5% false alarm rate.
|
author2 |
Yie-Tarng Chen |
author_facet |
Yie-Tarng Chen Cheng-Ru Wu 吳承儒 |
author |
Cheng-Ru Wu 吳承儒 |
spellingShingle |
Cheng-Ru Wu 吳承儒 P2P Flow Identification by Ensemble Classification |
author_sort |
Cheng-Ru Wu |
title |
P2P Flow Identification by Ensemble Classification |
title_short |
P2P Flow Identification by Ensemble Classification |
title_full |
P2P Flow Identification by Ensemble Classification |
title_fullStr |
P2P Flow Identification by Ensemble Classification |
title_full_unstemmed |
P2P Flow Identification by Ensemble Classification |
title_sort |
p2p flow identification by ensemble classification |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/21843436536868745208 |
work_keys_str_mv |
AT chengruwu p2pflowidentificationbyensembleclassification AT wúchéngrú p2pflowidentificationbyensembleclassification AT chengruwu yǐzǔhéshìfēnlèiqìzhēncèdiǎnduìdiǎnchuánshūzīliàozhīyánjiū AT wúchéngrú yǐzǔhéshìfēnlèiqìzhēncèdiǎnduìdiǎnchuánshūzīliàozhīyánjiū |
_version_ |
1718254874472218624 |