OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE

Lossless data compression is used to reduce storage requirements, allowing for the relief of I/O channels and better utilization of bandwidth. The Lempel-Ziv lossless compression algorithms form the basis for many of the most commonly used compression schemes. General purpose computing on graphic pr...

Full description

Bibliographic Details
Main Author: Ching, Bryan
Format: Others
Published: DigitalCommons@CalPoly 2014
Subjects:
GPU
Online Access:https://digitalcommons.calpoly.edu/theses/1238
https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=2361&context=theses
id ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-2361
record_format oai_dc
spelling ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-23612021-09-14T05:01:36Z OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE Ching, Bryan Lossless data compression is used to reduce storage requirements, allowing for the relief of I/O channels and better utilization of bandwidth. The Lempel-Ziv lossless compression algorithms form the basis for many of the most commonly used compression schemes. General purpose computing on graphic processing units (GPGPUs) allows us to take advantage of the massively parallel nature of GPUs for computations other that their original purpose of rendering graphics. Our work targets the use of GPUs for general lossless data compression. Specifically, we developed and ported an algorithm that constructs the Lempel-Ziv factorization directly on the GPU. Our implementation bypasses the sequential nature of the LZ factorization and attempts to compute the factorization in parallel. By breaking down the LZ factorization into what we call the PLZ, we are able to outperform the fastest serial CPU implementations by up to 24x and perform comparatively to a parallel multicore CPU implementation. To achieve these speeds, our implementation outputted LZ factorizations that were on average only 0.01 percent greater than the optimal solution that what could be computed sequentially. We are also able to reevaluate the fastest GPU suffix array construction algorithm, which is needed to compute the LZ factorization. We are able to find speedups of up to 5x over the fastest CPU implementations. 2014-06-01T07:00:00Z text application/pdf https://digitalcommons.calpoly.edu/theses/1238 https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=2361&context=theses Master's Theses DigitalCommons@CalPoly GPU CUDA Parallelism Compression LZ77 Lempel-Ziv Computer and Systems Architecture
collection NDLTD
format Others
sources NDLTD
topic GPU
CUDA
Parallelism
Compression
LZ77
Lempel-Ziv
Computer and Systems Architecture
spellingShingle GPU
CUDA
Parallelism
Compression
LZ77
Lempel-Ziv
Computer and Systems Architecture
Ching, Bryan
OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
description Lossless data compression is used to reduce storage requirements, allowing for the relief of I/O channels and better utilization of bandwidth. The Lempel-Ziv lossless compression algorithms form the basis for many of the most commonly used compression schemes. General purpose computing on graphic processing units (GPGPUs) allows us to take advantage of the massively parallel nature of GPUs for computations other that their original purpose of rendering graphics. Our work targets the use of GPUs for general lossless data compression. Specifically, we developed and ported an algorithm that constructs the Lempel-Ziv factorization directly on the GPU. Our implementation bypasses the sequential nature of the LZ factorization and attempts to compute the factorization in parallel. By breaking down the LZ factorization into what we call the PLZ, we are able to outperform the fastest serial CPU implementations by up to 24x and perform comparatively to a parallel multicore CPU implementation. To achieve these speeds, our implementation outputted LZ factorizations that were on average only 0.01 percent greater than the optimal solution that what could be computed sequentially. We are also able to reevaluate the fastest GPU suffix array construction algorithm, which is needed to compute the LZ factorization. We are able to find speedups of up to 5x over the fastest CPU implementations.
author Ching, Bryan
author_facet Ching, Bryan
author_sort Ching, Bryan
title OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
title_short OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
title_full OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
title_fullStr OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
title_full_unstemmed OPTIMIZING LEMPEL-ZIV FACTORIZATION FOR THE GPU ARCHITECTURE
title_sort optimizing lempel-ziv factorization for the gpu architecture
publisher DigitalCommons@CalPoly
publishDate 2014
url https://digitalcommons.calpoly.edu/theses/1238
https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=2361&context=theses
work_keys_str_mv AT chingbryan optimizinglempelzivfactorizationforthegpuarchitecture
_version_ 1719480489611362304