Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
Following trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnec...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-07-01
|
Series: | Electronics |
Subjects: | |
Online Access: | http://www.mdpi.com/2079-9292/7/8/130 |
id |
doaj-d022d58b9fd94008abb33f59561d50ab |
---|---|
record_format |
Article |
spelling |
doaj-d022d58b9fd94008abb33f59561d50ab2020-11-25T02:20:51ZengMDPI AGElectronics2079-92922018-07-017813010.3390/electronics7080130electronics7080130Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning ArchitectureYuhwan Ro0Eojin Lee1Jung Ho Ahn2Department of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaDepartment of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaDepartment of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaFollowing trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnected accelerator chips. We observed that inter-accelerator communication within a cluster has a significant impact on the training time of the neural network. In this paper, we show the advantages of optical interconnects for multi-chip machine-learning architecture by demonstrating performance improvements through replacing electrical interconnects with optical ones in an existing multi-chip system. We propose to use highly practical optical interconnect implementation and devise an arithmetic performance model to fairly assess the impact of optical interconnects on a machine-learning accelerator platform. In our evaluation of nine Convolutional Neural Networks with various input sizes, 100 and 400 Gbps optical interconnects reduce the training time by an average of 20.6% and 35.6%, respectively, compared to the baseline system with 25.6 Gbps electrical ones.http://www.mdpi.com/2079-9292/7/8/130machine learningacceleratoroptical interconnectmulti-chip architectureclusterConvolutional Neural Network (CNN) |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yuhwan Ro Eojin Lee Jung Ho Ahn |
spellingShingle |
Yuhwan Ro Eojin Lee Jung Ho Ahn Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture Electronics machine learning accelerator optical interconnect multi-chip architecture cluster Convolutional Neural Network (CNN) |
author_facet |
Yuhwan Ro Eojin Lee Jung Ho Ahn |
author_sort |
Yuhwan Ro |
title |
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture |
title_short |
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture |
title_full |
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture |
title_fullStr |
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture |
title_full_unstemmed |
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture |
title_sort |
evaluating the impact of optical interconnects on a multi-chip machine-learning architecture |
publisher |
MDPI AG |
series |
Electronics |
issn |
2079-9292 |
publishDate |
2018-07-01 |
description |
Following trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnected accelerator chips. We observed that inter-accelerator communication within a cluster has a significant impact on the training time of the neural network. In this paper, we show the advantages of optical interconnects for multi-chip machine-learning architecture by demonstrating performance improvements through replacing electrical interconnects with optical ones in an existing multi-chip system. We propose to use highly practical optical interconnect implementation and devise an arithmetic performance model to fairly assess the impact of optical interconnects on a machine-learning accelerator platform. In our evaluation of nine Convolutional Neural Networks with various input sizes, 100 and 400 Gbps optical interconnects reduce the training time by an average of 20.6% and 35.6%, respectively, compared to the baseline system with 25.6 Gbps electrical ones. |
topic |
machine learning accelerator optical interconnect multi-chip architecture cluster Convolutional Neural Network (CNN) |
url |
http://www.mdpi.com/2079-9292/7/8/130 |
work_keys_str_mv |
AT yuhwanro evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture AT eojinlee evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture AT junghoahn evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture |
_version_ |
1724869407587434496 |