Stream-based Packet Processing on General Purpose GPU Architecture

碩士 === 中原大學 === 電子工程研究所 === 99 === GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using Ope...

Full description

Bibliographic Details
Main Authors: Theophilus-Yohanis Hermanus, 魏特佑
Other Authors: Yu-Kuen Lai
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/18902792511191815664
id ndltd-TW-099CYCU5428051
record_format oai_dc
spelling ndltd-TW-099CYCU54280512015-10-13T20:23:26Z http://ndltd.ncl.edu.tw/handle/18902792511191815664 Stream-based Packet Processing on General Purpose GPU Architecture 於通用繪圖處理器架構實現川流式網路封包處理之研究與探討 Theophilus-Yohanis Hermanus 魏特佑 碩士 中原大學 電子工程研究所 99 GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using OpenCL parallel programming framework. Due to parallel nature of sketch data structure, the sketch computations can be mapped to the OpenCL execution model on GPU, multi-core CPU, and Cell processor. The sketch data structure is mapped to buffer object in device's global memory and work-items are executed on these sketches in parallel. The experiment results on Radeon HD 5870 GPU show that the parallel implementation of these sketch operations can speedup the computation time compared to sequential CPU implementation. The hash computation and ESTIMATE operation achieved 15.3X and 9.1X speedup, respectively. Our kernel implementation can reached more than 50% (78.64 GB/s) peak memory bandwidth of the 5870 GPU. The results also show that GPU is suitable for the sketch computations from multi-monitor and the data transfer rate from CPU to GPU is more effective if more than one monitor is used. For 16 monitors, the transfer rate for transferring keys from CPU memory to buffer in GPU memory can reached 2.28 GB/s. On multi-core CPU and Cell processor, using the same kernels with GPU without any optimizations, compared to sequential CPU implementation, the ESTIMATE operation can achieved 5.7X and 5.83X speedup, respectively. Yu-Kuen Lai 賴裕昆 2011 學位論文 ; thesis 121 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 中原大學 === 電子工程研究所 === 99 === GPU and other SIMD stream architecture have been used for accelerating packet processing applications. This thesis explores the parallel implementation of sketch-based network traffic change detection application on GPU, multi-core CPU, and Cell processor using OpenCL parallel programming framework. Due to parallel nature of sketch data structure, the sketch computations can be mapped to the OpenCL execution model on GPU, multi-core CPU, and Cell processor. The sketch data structure is mapped to buffer object in device's global memory and work-items are executed on these sketches in parallel. The experiment results on Radeon HD 5870 GPU show that the parallel implementation of these sketch operations can speedup the computation time compared to sequential CPU implementation. The hash computation and ESTIMATE operation achieved 15.3X and 9.1X speedup, respectively. Our kernel implementation can reached more than 50% (78.64 GB/s) peak memory bandwidth of the 5870 GPU. The results also show that GPU is suitable for the sketch computations from multi-monitor and the data transfer rate from CPU to GPU is more effective if more than one monitor is used. For 16 monitors, the transfer rate for transferring keys from CPU memory to buffer in GPU memory can reached 2.28 GB/s. On multi-core CPU and Cell processor, using the same kernels with GPU without any optimizations, compared to sequential CPU implementation, the ESTIMATE operation can achieved 5.7X and 5.83X speedup, respectively.
author2 Yu-Kuen Lai
author_facet Yu-Kuen Lai
Theophilus-Yohanis Hermanus
魏特佑
author Theophilus-Yohanis Hermanus
魏特佑
spellingShingle Theophilus-Yohanis Hermanus
魏特佑
Stream-based Packet Processing on General Purpose GPU Architecture
author_sort Theophilus-Yohanis Hermanus
title Stream-based Packet Processing on General Purpose GPU Architecture
title_short Stream-based Packet Processing on General Purpose GPU Architecture
title_full Stream-based Packet Processing on General Purpose GPU Architecture
title_fullStr Stream-based Packet Processing on General Purpose GPU Architecture
title_full_unstemmed Stream-based Packet Processing on General Purpose GPU Architecture
title_sort stream-based packet processing on general purpose gpu architecture
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/18902792511191815664
work_keys_str_mv AT theophilusyohanishermanus streambasedpacketprocessingongeneralpurposegpuarchitecture
AT wèitèyòu streambasedpacketprocessingongeneralpurposegpuarchitecture
AT theophilusyohanishermanus yútōngyònghuìtúchùlǐqìjiàgòushíxiànchuānliúshìwǎnglùfēngbāochùlǐzhīyánjiūyǔtàntǎo
AT wèitèyòu yútōngyònghuìtúchùlǐqìjiàgòushíxiànchuānliúshìwǎnglùfēngbāochùlǐzhīyánjiūyǔtàntǎo
_version_ 1718047390501437440