A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU

A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU

To accelerate deep learning (DL) processes on the supercomputer Fugaku, the authors have ported and optimized oneDNN for Fugaku's CPU, the Fujitsu A64FX. oneDNN is an open-source DL processing library developed by Intel for the x86 64 architecture. The A64FX CPU is based on the Armv8-A architec...

Full description

Bibliographic Details
Main Authors:	Fukumoto, N. (Author), Honda, T. (Author), Kawakami, K. (Author), Kurihara, K. (Author), Yamazaki, M. (Author)
Format:	Article
Language:	English
Published:	Institute of Electronics Information Communication Engineers 2022
Subjects:	AArch64 binary translator deep learning just-in-time assembler oneDNN
Online Access:	View Fulltext in Publisher

Similar Items

Fine-Grained Isolation to Protect Data against In-Process Attacks on AArch64
by: Yeongpil Cho
Published: (2020-02-01)

An ECMA-55 Minimal BASIC Compiler for x86-64 Linux®
by: John Gatewood Ham
Published: (2014-10-01)

More Accurate Differential Properties of LED64 and Midori64
by: Ling Sun, et al.
Published: (2018-09-01)

Geometría Y Espacio-DI64-201202
by: Rozas Schmitt Cecilia, et al.
Published: (2020)

Atelier De La Creatividad-DM64-201201
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Atelier De La Creatividad-DM64-201202
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Atelier De La Creatividad-DM64-201301
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Atelier De La Creatividad-DM64-201302
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Atelier De La Creatividad-DM64-201401
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Atelier De La Creatividad-DM64-201402
by: Pilo Pais Figallo Natalia Benjamina
Published: (2020)

Reducing metal contamination in Cu-64 production
by: Poniger, S., et al.
Published: (2015)

Separation of Radiocopper 64/67Cu from the Matrix of Neutron-Irradiated Natural Zinc Applicable for 64Cu Production
by: S. Soenarjo, et al.
Published: (2012-04-01)

Porting the EOS from X86 (Intel) to aarch64 (ARM) architecture
by: Cheng Yaosong, et al.
Published: (2021-01-01)

Fast speaker adaptation using extended diagonal linear transformation for deep neural networks
by: Donghyun Kim, et al.
Published: (2018-11-01)

Gestión en Salud I - TF64 201701
by: Universidad Peruana de Ciencias Aplicadas (UPC)
Published: (2018)

Gestión en Salud I - TF64 201702
by: Universidad Peruana de Ciencias Aplicadas (UPC)
Published: (2018)

Gestión en Salud I - TF64 201801
by: Universidad Peruana de Ciencias Aplicadas (UPC)
Published: (2018)

Porting the LHCb Stack from x86 (Intel) to aarch64 (ARM) and ppc64le (PowerPC)
by: Promberger Laura, et al.
Published: (2019-01-01)

MOSDA: On-Chip Memory Optimized Sparse Deep Neural Network Accelerator With Efficient Index Matching
by: Hongjie Xu, et al.
Published: (2021-01-01)

Comparison of nuclear data of 64Cu production using an accelerator by TALYS 1.0 code
by: Sadeghi Mahdi, et al.
Published: (2010-01-01)

De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing
by: Tsuyoshi Tanaka, et al.
Published: (2020-05-01)

Apolonij Rodoški v katulovi 64. pesmi
by: Marko Marinčič
Published: (2000-07-01)

Copper-64: a real theranostic agent
by: Gutfilen B, et al.
Published: (2018-10-01)

Binary Output Layer of Feedforward Neural Networks for Solving Multi-Class Classification Problems
by: Sibo Yang, et al.
Published: (2019-01-01)

СOPPER-64 ISOTOPE PRODUCTION IN THE 64Ni(p,n)64Cu REACTION
by: TIBA ALI, et al.
Published: (2021-03-01)

The utility of 64-multidetector computed tomography in the diagnosis and staging of hepatoblastoma patients
by: Moustafa Abdel Kader, et al.
Published: (2016-12-01)

Biodistribution and radiation dosimetry of [64Cu]copper dichloride: first-in-human study in healthy volunteers
by: M.A. Avila-Rodriguez, et al.
Published: (2017-12-01)

Copper-64 radiopharmaceuticals for receptor-mediated tumor imaging and radiotherapy
by: Eiblmaier, Martin
Published: (2008)

Evaluation of the role of dynamic 64-MDCT in the characterization and work up of breast cancer
by: Moustafa A. Kader A. Wahab, et al.
Published: (2015-06-01)

Automated optimization for memory‐efficient high‐performance deep neural network accelerators
by: HyunMi Kim, et al.
Published: (2020-07-01)

G-protein coupled receptor 64 (GPR64) acts as a tumor suppressor in endometrial cancer
by: Jong Il Ahn, et al.
Published: (2019-08-01)

Flow Cytometric Determination of Neutrophil CD64 (nCD64) in Children with Community Acquired Pneumonia
by: Abdelhakeem Abdelmohsen, et al.
Published: (2019-08-01)

Mechanics of binary crushable granular assembly through discrete element method
by: Raghuram Karthik Desu, et al.
Published: (2016-12-01)

Are IEEE 754 32-Bit and 64-Bit Binary Floating-Point Accurate Enough?
by: Bernaridho Hutabarat, et al.
Published: (2011-09-01)

Protective effect of E-64d on calcium-induced cataract
by: Yi Yang, et al.
Published: (2015-06-01)

Racconti morali. 64. Berlinale – Internationale Filmfestspiele Berlin 2014
by: Leonardo Quaresima
Published: (2014-11-01)

Design Framework for ReRAM-Based DNN Accelerators with Accuracy and Hardware Evaluation
by: Cheng, W.-K, et al.
Published: (2022)

ESTADO, ACUMULAÇÃO E ORGANIZAÇÃO DO ESPAÇO BRASILEIRO PÓS-64
by: Valdeci Monteiro dos Santos
Published: (1992-06-01)

The role of nCD64 in the diagnosis of neonatal sepsis in preterm newborns
by: Jan Halek, et al.
Published: (2018-11-01)

Deep Learning for Diagnostic Binary Classification of Multiple-Lesion Skin Diseases
by: Kenneth Thomsen, et al.
Published: (2020-09-01)