Solving Hyperbolic PDEs using Accelerator Architectures
Accelerator architectures are used to accelerate the simulation of nonlinear hyperbolic PDEs. Three different architectures, a multicore CPU using threading, IBM’s Cell Processor, and Nvidia’s Tesla GPUs are investigated. Speed-ups of between 40-75× relative to a single CPU core in single precisi...
Main Author: | |
---|---|
Language: | en |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10012/4518 |
id |
ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-4518 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-45182013-01-08T18:52:27ZRostrup, Scott2009-07-27T13:29:05Z2009-07-27T13:29:05Z2009-07-27T13:29:05Z2009-07-15http://hdl.handle.net/10012/4518Accelerator architectures are used to accelerate the simulation of nonlinear hyperbolic PDEs. Three different architectures, a multicore CPU using threading, IBM’s Cell Processor, and Nvidia’s Tesla GPUs are investigated. Speed-ups of between 40-75× relative to a single CPU core in single precision are obtained using the Cell processor and the GPU. The three implementations are extended to parallel computing clusters by making use of the Message Passing Interface (MPI). The resulting hybrid-parallel code is investigated for performance and scalability on both a GPU and Cell computing cluster.enGPUCell ProcessorHyperbolic PDEsHardware OptimizationSolving Hyperbolic PDEs using Accelerator ArchitecturesThesis or DissertationApplied MathematicsMaster of MathematicsApplied Mathematics |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
topic |
GPU Cell Processor Hyperbolic PDEs Hardware Optimization Applied Mathematics |
spellingShingle |
GPU Cell Processor Hyperbolic PDEs Hardware Optimization Applied Mathematics Rostrup, Scott Solving Hyperbolic PDEs using Accelerator Architectures |
description |
Accelerator architectures are used to accelerate the
simulation of nonlinear hyperbolic PDEs. Three different architectures, a multicore
CPU using threading, IBM’s Cell Processor, and Nvidia’s Tesla GPUs are investigated. Speed-ups of between 40-75× relative to a single CPU core in single precision are obtained using the Cell processor and the GPU. The three implementations are extended to parallel computing clusters by making use
of the Message Passing Interface (MPI). The resulting hybrid-parallel code is investigated
for performance and scalability on both a GPU and Cell computing cluster. |
author |
Rostrup, Scott |
author_facet |
Rostrup, Scott |
author_sort |
Rostrup, Scott |
title |
Solving Hyperbolic PDEs using Accelerator Architectures |
title_short |
Solving Hyperbolic PDEs using Accelerator Architectures |
title_full |
Solving Hyperbolic PDEs using Accelerator Architectures |
title_fullStr |
Solving Hyperbolic PDEs using Accelerator Architectures |
title_full_unstemmed |
Solving Hyperbolic PDEs using Accelerator Architectures |
title_sort |
solving hyperbolic pdes using accelerator architectures |
publishDate |
2009 |
url |
http://hdl.handle.net/10012/4518 |
work_keys_str_mv |
AT rostrupscott solvinghyperbolicpdesusingacceleratorarchitectures |
_version_ |
1716573354946723840 |