Optimization of the Brillouin operator on the KNL architecture

Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with Nc = 3 colors, Nv =...

Full description

Bibliographic Details
Main Author: Dürr Stephan
Format: Article
Language:English
Published: EDP Sciences 2018-01-01
Series:EPJ Web of Conferences
Online Access:https://doi.org/10.1051/epjconf/201817502001