A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU
To accelerate deep learning (DL) processes on the supercomputer Fugaku, the authors have ported and optimized oneDNN for Fugaku's CPU, the Fujitsu A64FX. oneDNN is an open-source DL processing library developed by Intel for the x86 64 architecture. The A64FX CPU is based on the Armv8-A architec...
Main Authors: | Fukumoto, N. (Author), Honda, T. (Author), Kawakami, K. (Author), Kurihara, K. (Author), Yamazaki, M. (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Electronics Information Communication Engineers
2022
|
Subjects: | |
Online Access: | View Fulltext in Publisher |
Similar Items
-
Fine-Grained Isolation to Protect Data against In-Process Attacks on AArch64
by: Yeongpil Cho
Published: (2020-02-01) -
An ECMA-55 Minimal BASIC Compiler for x86-64 Linux®
by: John Gatewood Ham
Published: (2014-10-01) -
More Accurate Differential Properties of LED64 and Midori64
by: Ling Sun, et al.
Published: (2018-09-01) -
Geometría Y Espacio-DI64-201202
by: Rozas Schmitt Cecilia, et al.
Published: (2020) -
Atelier De La Creatividad-DM64-201201
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)