A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.

In this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free...

Full description

Bibliographic Details
Main Authors: Christian Schlinkmann, Michael Roland, Christian Wolff, Patrick Trampert, Philipp Slusallek, Stefan Diebels, Tim Dahmen
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0240813
id doaj-5df88e4842d64ef6839c8a468cdf891a
record_format Article
spelling doaj-5df88e4842d64ef6839c8a468cdf891a2021-03-04T11:08:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011510e024081310.1371/journal.pone.0240813A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.Christian SchlinkmannMichael RolandChristian WolffPatrick TrampertPhilipp SlusallekStefan DiebelsTim DahmenIn this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free FEM. The node-by-node method lies in the class of block-iterative Gauss-Seidel multigrid solvers. Our method significantly improves convergence times in cases where an ordered distribution of distinct materials is present in the dataset. The method was evaluated on three real world datasets: An aluminum-silicon (AlSi) alloy and a dual phase steel material sample, both captured by scanning electron tomography, and a clinical computed tomography (CT) scan of a tibia. The caching scheme leads to a speed-up factor of ×2-×4 compared to the same code without the caching scheme. Additionally, it facilitates the computation of high-resolution problems that cannot be computed otherwise due to memory consumption.https://doi.org/10.1371/journal.pone.0240813
collection DOAJ
language English
format Article
sources DOAJ
author Christian Schlinkmann
Michael Roland
Christian Wolff
Patrick Trampert
Philipp Slusallek
Stefan Diebels
Tim Dahmen
spellingShingle Christian Schlinkmann
Michael Roland
Christian Wolff
Patrick Trampert
Philipp Slusallek
Stefan Diebels
Tim Dahmen
A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
PLoS ONE
author_facet Christian Schlinkmann
Michael Roland
Christian Wolff
Patrick Trampert
Philipp Slusallek
Stefan Diebels
Tim Dahmen
author_sort Christian Schlinkmann
title A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
title_short A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
title_full A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
title_fullStr A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
title_full_unstemmed A GPU-based caching strategy for multi-material linear elastic FEM on regular grids.
title_sort gpu-based caching strategy for multi-material linear elastic fem on regular grids.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description In this study, we present a novel strategy to the method of finite elements (FEM) of linear elastic problems of very high resolution on graphic processing units (GPU). The approach exploits regularities in the system matrix that occur in regular hexahedral grids to achieve cache-friendly matrix-free FEM. The node-by-node method lies in the class of block-iterative Gauss-Seidel multigrid solvers. Our method significantly improves convergence times in cases where an ordered distribution of distinct materials is present in the dataset. The method was evaluated on three real world datasets: An aluminum-silicon (AlSi) alloy and a dual phase steel material sample, both captured by scanning electron tomography, and a clinical computed tomography (CT) scan of a tibia. The caching scheme leads to a speed-up factor of ×2-×4 compared to the same code without the caching scheme. Additionally, it facilitates the computation of high-resolution problems that cannot be computed otherwise due to memory consumption.
url https://doi.org/10.1371/journal.pone.0240813
work_keys_str_mv AT christianschlinkmann agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT michaelroland agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT christianwolff agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT patricktrampert agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT philippslusallek agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT stefandiebels agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT timdahmen agpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT christianschlinkmann gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT michaelroland gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT christianwolff gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT patricktrampert gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT philippslusallek gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT stefandiebels gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
AT timdahmen gpubasedcachingstrategyformultimateriallinearelasticfemonregulargrids
_version_ 1714804872847556608