Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations
Graphics processors (GPU -- Graphic Processor Units) recently have gained a lot of interest as an efficient platform for general-purpose computation. Cellular Automata approach which is inherently parallel gives the opportunity to implement high performance simulations. This paper presents how share...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
AGH University of Science and Technology Press
2013-01-01
|
Series: | Computer Science |
Subjects: | |
Online Access: | http://journals.agh.edu.pl/csci/article/download/69/570 |
id |
doaj-86c51a490caf4b2b8917a59070ecd344 |
---|---|
record_format |
Article |
spelling |
doaj-86c51a490caf4b2b8917a59070ecd3442020-11-24T23:46:15ZengAGH University of Science and Technology PressComputer Science1508-28062013-01-0114338510.7494/csci.2013.14.3.385Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow SimulationsPaweł Topa0Paweł Młocek1AGH University of Science and Technology, al. Mickiewicza 30, Kraków, PolandAGH University of Science and Technology, al. Mickiewicza 30, Kraków, PolandGraphics processors (GPU -- Graphic Processor Units) recently have gained a lot of interest as an efficient platform for general-purpose computation. Cellular Automata approach which is inherently parallel gives the opportunity to implement high performance simulations. This paper presents how shared memory in GPU can be used to improve performance for Cellular Automata models. In our previous works, we proposed algorithms for Cellular Automata model that use only a GPU global memory. Using a profiling tool, we found bottlenecks in our approach. We introduce modifications that takes an advantage of fast shared memory. The modified algorithm is presented in details, and the results of profiling and performance test are demonstrated. Our unique achievement is comparing the efficiency of the same algorithm working with a global and shared memory.http://journals.agh.edu.pl/csci/article/download/69/570computer sciencecellular automatagpu computationmodelling physical phenomena |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Paweł Topa Paweł Młocek |
spellingShingle |
Paweł Topa Paweł Młocek Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations Computer Science computer sciencecellular automata gpu computation modelling physical phenomena |
author_facet |
Paweł Topa Paweł Młocek |
author_sort |
Paweł Topa |
title |
Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations |
title_short |
Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations |
title_full |
Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations |
title_fullStr |
Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations |
title_full_unstemmed |
Using Shared Memory As A Cache In High Performance Cellular Automata Water Flow Simulations |
title_sort |
using shared memory as a cache in high performance cellular automata water flow simulations |
publisher |
AGH University of Science and Technology Press |
series |
Computer Science |
issn |
1508-2806 |
publishDate |
2013-01-01 |
description |
Graphics processors (GPU -- Graphic Processor Units) recently have gained a lot of interest as an efficient platform for general-purpose computation. Cellular Automata approach which is inherently parallel gives the opportunity to implement high performance simulations. This paper presents how shared memory in GPU can be used to improve performance for Cellular Automata models. In our previous works, we proposed algorithms for Cellular Automata model that use only a GPU global memory. Using a profiling tool, we found bottlenecks in our approach. We introduce modifications that takes an advantage of fast shared memory. The modified algorithm is presented in details, and the results of profiling and performance test are demonstrated. Our unique achievement is comparing the efficiency of the same algorithm working with a global and shared memory. |
topic |
computer sciencecellular automata gpu computation modelling physical phenomena |
url |
http://journals.agh.edu.pl/csci/article/download/69/570 |
work_keys_str_mv |
AT pawełtopa usingsharedmemoryasacacheinhighperformancecellularautomatawaterflowsimulations AT pawełmłocek usingsharedmemoryasacacheinhighperformancecellularautomatawaterflowsimulations |
_version_ |
1725494018052194304 |