Optimization of the Brillouin operator on the KNL architecture
Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with Nc = 3 colors, Nv =...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2018-01-01
|
Series: | EPJ Web of Conferences |
Online Access: | https://doi.org/10.1051/epjconf/201817502001 |