P2P Flow Identification by Ensemble Classification

碩士 === 國立臺灣科技大學 === 電子工程系 === 97 === Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and...

Full description

Bibliographic Details
Main Authors: Cheng-Ru Wu, 吳承儒
Other Authors: Yie-Tarng Chen
Format: Others
Language:en_US
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/21843436536868745208
id ndltd-TW-097NTUS5428175
record_format oai_dc
spelling ndltd-TW-097NTUS54281752016-05-02T04:11:48Z http://ndltd.ncl.edu.tw/handle/21843436536868745208 P2P Flow Identification by Ensemble Classification 以組合式分類器偵測點對點傳輸資料之研究 Cheng-Ru Wu 吳承儒 碩士 國立臺灣科技大學 電子工程系 97 Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and payload information, since new-generation P2P applications can use arbitrary port number to avoid fixed-port block and use payload encryption to avoid P2P signature detection. Previous research that use machine learning approach for P2P flow identification, suffer form low detection rate and high false positive rate due to lack for proper features. In our research, we propose an ensemble classification approach, which integrates Hidden Markov Model (HMM) and Adaboost algorithm. The proposed P2P identification scheme can be divided into two stages. In the first stage, we investigated the phenomenon of small packet and large packet interchange in the P2P flow and identified an important feature, called packet size sequence pattern, and use Hidden Markov Model (HMM) to recognize the patterns. In the second stage, we use Adaboost algorithm with traditional flow attributes to promote the detection accuracy and reduce false positive in classification. To verify the performance of the proposed P2P identification based on ensemble classification, we collect network traffic traces from NTUST campus, and run intensive simulations. The simulation results show that the ensemble classification approach for P2P flow identification can achieve 98% detection rate and 5% false alarm rate. Yie-Tarng Chen 陳郁堂 2009 學位論文 ; thesis 55 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 電子工程系 === 97 === Peer-to-peer (P2P) traffic has accounted for major fraction of all internet traffic. Hence, P2P flow identification becomes an important problem for network management. A robust P2P flow identification approach should operate properly without port information and payload information, since new-generation P2P applications can use arbitrary port number to avoid fixed-port block and use payload encryption to avoid P2P signature detection. Previous research that use machine learning approach for P2P flow identification, suffer form low detection rate and high false positive rate due to lack for proper features. In our research, we propose an ensemble classification approach, which integrates Hidden Markov Model (HMM) and Adaboost algorithm. The proposed P2P identification scheme can be divided into two stages. In the first stage, we investigated the phenomenon of small packet and large packet interchange in the P2P flow and identified an important feature, called packet size sequence pattern, and use Hidden Markov Model (HMM) to recognize the patterns. In the second stage, we use Adaboost algorithm with traditional flow attributes to promote the detection accuracy and reduce false positive in classification. To verify the performance of the proposed P2P identification based on ensemble classification, we collect network traffic traces from NTUST campus, and run intensive simulations. The simulation results show that the ensemble classification approach for P2P flow identification can achieve 98% detection rate and 5% false alarm rate.
author2 Yie-Tarng Chen
author_facet Yie-Tarng Chen
Cheng-Ru Wu
吳承儒
author Cheng-Ru Wu
吳承儒
spellingShingle Cheng-Ru Wu
吳承儒
P2P Flow Identification by Ensemble Classification
author_sort Cheng-Ru Wu
title P2P Flow Identification by Ensemble Classification
title_short P2P Flow Identification by Ensemble Classification
title_full P2P Flow Identification by Ensemble Classification
title_fullStr P2P Flow Identification by Ensemble Classification
title_full_unstemmed P2P Flow Identification by Ensemble Classification
title_sort p2p flow identification by ensemble classification
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/21843436536868745208
work_keys_str_mv AT chengruwu p2pflowidentificationbyensembleclassification
AT wúchéngrú p2pflowidentificationbyensembleclassification
AT chengruwu yǐzǔhéshìfēnlèiqìzhēncèdiǎnduìdiǎnchuánshūzīliàozhīyánjiū
AT wúchéngrú yǐzǔhéshìfēnlèiqìzhēncèdiǎnduìdiǎnchuánshūzīliàozhīyánjiū
_version_ 1718254874472218624