Stream-based Packet Processing on General Purpose GPU Architecture

碩士 === 中原大學 === 電子工程研究所 === 99 === GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using Ope...

Full description

Bibliographic Details
Main Authors:	Theophilus-Yohanis Hermanus, 魏特佑
Other Authors:	Yu-Kuen Lai
Format:	Others
Language:	en_US
Published:	2011
Online Access:	http://ndltd.ncl.edu.tw/handle/18902792511191815664

id	ndltd-TW-099CYCU5428051
record_format	oai_dc
spelling	ndltd-TW-099CYCU54280512015-10-13T20:23:26Z http://ndltd.ncl.edu.tw/handle/18902792511191815664 Stream-based Packet Processing on General Purpose GPU Architecture 於通用繪圖處理器架構實現川流式網路封包處理之研究與探討 Theophilus-Yohanis Hermanus 魏特佑碩士中原大學電子工程研究所 99 GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using OpenCL parallel programming framework. Due to parallel nature of sketch data structure, the sketch computations can be mapped to the OpenCL execution model on GPU, multi-core CPU, and Cell processor. The sketch data structure is mapped to buffer object in device's global memory and work-items are executed on these sketches in parallel. The experiment results on Radeon HD 5870 GPU show that the parallel implementation of these sketch operations can speedup the computation time compared to sequential CPU implementation. The hash computation and ESTIMATE operation achieved 15.3X and 9.1X speedup, respectively. Our kernel implementation can reached more than 50% (78.64 GB/s) peak memory bandwidth of the 5870 GPU. The results also show that GPU is suitable for the sketch computations from multi-monitor and the data transfer rate from CPU to GPU is more effective if more than one monitor is used. For 16 monitors, the transfer rate for transferring keys from CPU memory to buffer in GPU memory can reached 2.28 GB/s. On multi-core CPU and Cell processor, using the same kernels with GPU without any optimizations, compared to sequential CPU implementation, the ESTIMATE operation can achieved 5.7X and 5.83X speedup, respectively. Yu-Kuen Lai 賴裕昆 2011 學位論文 ; thesis 121 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 中原大學 === 電子工程研究所 === 99 === GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using OpenCL parallel programming framework. Due to parallel nature of sketch data structure, the sketch computations can be mapped to the OpenCL execution model on GPU, multi-core CPU, and Cell processor. The sketch data structure is mapped to buffer object in device's global memory and work-items are executed on these sketches in parallel. The experiment results on Radeon HD 5870 GPU show that the parallel implementation of these sketch operations can speedup the computation time compared to sequential CPU implementation. The hash computation and ESTIMATE operation achieved 15.3X and 9.1X speedup, respectively. Our kernel implementation can reached more than 50% (78.64 GB/s) peak memory bandwidth of the 5870 GPU. The results also show that GPU is suitable for the sketch computations from multi-monitor and the data transfer rate from CPU to GPU is more effective if more than one monitor is used. For 16 monitors, the transfer rate for transferring keys from CPU memory to buffer in GPU memory can reached 2.28 GB/s. On multi-core CPU and Cell processor, using the same kernels with GPU without any optimizations, compared to sequential CPU implementation, the ESTIMATE operation can achieved 5.7X and 5.83X speedup, respectively.
author2	Yu-Kuen Lai
author_facet	Yu-Kuen Lai Theophilus-Yohanis Hermanus 魏特佑
author	Theophilus-Yohanis Hermanus 魏特佑
spellingShingle	Theophilus-Yohanis Hermanus 魏特佑 Stream-based Packet Processing on General Purpose GPU Architecture
author_sort	Theophilus-Yohanis Hermanus
title	Stream-based Packet Processing on General Purpose GPU Architecture
title_short	Stream-based Packet Processing on General Purpose GPU Architecture
title_full	Stream-based Packet Processing on General Purpose GPU Architecture
title_fullStr	Stream-based Packet Processing on General Purpose GPU Architecture
title_full_unstemmed	Stream-based Packet Processing on General Purpose GPU Architecture
title_sort	stream-based packet processing on general purpose gpu architecture
publishDate	2011
url	http://ndltd.ncl.edu.tw/handle/18902792511191815664
work_keys_str_mv	AT theophilusyohanishermanus streambasedpacketprocessingongeneralpurposegpuarchitecture AT wèitèyòu streambasedpacketprocessingongeneralpurposegpuarchitecture AT theophilusyohanishermanus yútōngyònghuìtúchùlǐqìjiàgòushíxiànchuānliúshìwǎnglùfēngbāochùlǐzhīyánjiūyǔtàntǎo AT wèitèyòu yútōngyònghuìtúchùlǐqìjiàgòushíxiànchuānliúshìwǎnglùfēngbāochùlǐzhīyánjiūyǔtàntǎo
_version_	1718047390501437440

Stream-based Packet Processing on General Purpose GPU Architecture

Similar Items