A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
In this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0240813 |
id |
doaj-5df88e4842d64ef6839c8a468cdf891a |
---|---|
record_format |
Article |
spelling |
doaj-5df88e4842d64ef6839c8a468cdf891a2021-03-04T11:08:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011510e024081310.1371/journal.pone.0240813A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.Christian SchlinkmannMichael RolandChristian WolffPatrick TrampertPhilipp SlusallekStefan DiebelsTim DahmenIn this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free FEM. The node-by-node method lies in the class of block-iterative Gauss-Seidel multigrid solvers. Our method significantly improves convergence times in cases where an ordered distribution of distinct materials is present in the dataset. The method was evaluated on three real world datasets: An aluminum-silicon (AlSi) alloy and a dual phase steel material sample, both captured by scanning electron tomography, and a clinical computed tomography (CT) scan of a tibia. The caching scheme leads to a speed-up factor of ×2-×4 compared to the same code without the caching scheme. Additionally, it facilitates the computation of high-resolution problems that cannot be computed otherwise due to memory consumption.https://doi.org/10.1371/journal.pone.0240813 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Christian Schlinkmann Michael Roland Christian Wolff Patrick Trampert Philipp Slusallek Stefan Diebels Tim Dahmen |
spellingShingle |
Christian Schlinkmann Michael Roland Christian Wolff Patrick Trampert Philipp Slusallek Stefan Diebels Tim Dahmen A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. PLoS ONE |
author_facet |
Christian Schlinkmann Michael Roland Christian Wolff Patrick Trampert Philipp Slusallek Stefan Diebels Tim Dahmen |
author_sort |
Christian Schlinkmann |
title |
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. |
title_short |
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. |
title_full |
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. |
title_fullStr |
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. |
title_full_unstemmed |
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids. |
title_sort |
gpu-based caching strategy for multi-material linear elastic fem on regular grids. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
In this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free FEM. The node-by-node method lies in the class of block-iterative Gauss-Seidel multigrid solvers. Our method significantly improves convergence times in cases where an ordered distribution of distinct materials is present in the dataset. The method was evaluated on three real world datasets: An aluminum-silicon (AlSi) alloy and a dual phase steel material sample, both captured by scanning electron tomography, and a clinical computed tomography (CT) scan of a tibia. The caching scheme leads to a speed-up factor of ×2-×4 compared to the same code without the caching scheme. Additionally, it facilitates the computation of high-resolution problems that cannot be computed otherwise due to memory consumption. |
url |
https://doi.org/10.1371/journal.pone.0240813 |
work_keys_str_mv |
AT christianschlinkmann agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT michaelroland agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT christianwolff agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT patricktrampert agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT philippslusallek agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT stefandiebels agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT timdahmen agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT christianschlinkmann gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT michaelroland gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT christianwolff gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT patricktrampert gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT philippslusallek gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT stefandiebels gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids AT timdahmen gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids |
_version_ |
1714804872847556608 |