Optimization of the Brillouin operator on the KNL architecture
Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with Nc = 3 colors, Nv =...
Main Author: | Dürr Stephan |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2018-01-01
|
Series: | EPJ Web of Conferences |
Online Access: | https://doi.org/10.1051/epjconf/201817502001 |
Similar Items
-
Software and DVFS Tuning for Performance and Energy-Efficiency on Intel KNL Processors
by: Enrico Calore, et al.
Published: (2018-06-01) -
The outer kinetochore protein KNL-1 contains a defined oligomerization domain in nematodes
by: Kern, David Matthew, et al.
Published: (2015) -
The RZZ complex requires the N-terminus of KNL1 to mediate optimal Mad1 kinetochore localization in human cells
by: Gina V. Caldas, et al.
Published: (2015-01-01) -
Distinct Roles of RZZ and Bub1-KNL1 in Mitotic Checkpoint Signaling and Kinetochore Expansion
by: Rodriguez-Rodriguez, et al.
Published: (2020) -
Bub1 positions Mad1 close to KNL1 MELT repeats to promote checkpoint signalling
by: Gang Zhang, et al.
Published: (2017-06-01)