Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture

Following trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnec...

Full description

Bibliographic Details
Main Authors: Yuhwan Ro, Eojin Lee, Jung Ho Ahn
Format: Article
Language:English
Published: MDPI AG 2018-07-01
Series:Electronics
Subjects:
Online Access:http://www.mdpi.com/2079-9292/7/8/130
id doaj-d022d58b9fd94008abb33f59561d50ab
record_format Article
spelling doaj-d022d58b9fd94008abb33f59561d50ab2020-11-25T02:20:51ZengMDPI AGElectronics2079-92922018-07-017813010.3390/electronics7080130electronics7080130Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning ArchitectureYuhwan Ro0Eojin Lee1Jung Ho Ahn2Department of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaDepartment of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaDepartment of Transdisciplinary Studies, Seoul National University, Seoul 08826, KoreaFollowing trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnected accelerator chips. We observed that inter-accelerator communication within a cluster has a significant impact on the training time of the neural network. In this paper, we show the advantages of optical interconnects for multi-chip machine-learning architecture by demonstrating performance improvements through replacing electrical interconnects with optical ones in an existing multi-chip system. We propose to use highly practical optical interconnect implementation and devise an arithmetic performance model to fairly assess the impact of optical interconnects on a machine-learning accelerator platform. In our evaluation of nine Convolutional Neural Networks with various input sizes, 100 and 400 Gbps optical interconnects reduce the training time by an average of 20.6% and 35.6%, respectively, compared to the baseline system with 25.6 Gbps electrical ones.http://www.mdpi.com/2079-9292/7/8/130machine learningacceleratoroptical interconnectmulti-chip architectureclusterConvolutional Neural Network (CNN)
collection DOAJ
language English
format Article
sources DOAJ
author Yuhwan Ro
Eojin Lee
Jung Ho Ahn
spellingShingle Yuhwan Ro
Eojin Lee
Jung Ho Ahn
Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
Electronics
machine learning
accelerator
optical interconnect
multi-chip architecture
cluster
Convolutional Neural Network (CNN)
author_facet Yuhwan Ro
Eojin Lee
Jung Ho Ahn
author_sort Yuhwan Ro
title Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
title_short Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
title_full Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
title_fullStr Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
title_full_unstemmed Evaluating the Impact of Optical Interconnects on a Multi-Chip Machine-Learning Architecture
title_sort evaluating the impact of optical interconnects on a multi-chip machine-learning architecture
publisher MDPI AG
series Electronics
issn 2079-9292
publishDate 2018-07-01
description Following trends that emphasize neural networks for machine learning, many studies regarding computing systems have focused on accelerating deep neural networks. These studies often propose utilizing the accelerator specialized in a neural network and the cluster architecture composed of interconnected accelerator chips. We observed that inter-accelerator communication within a cluster has a significant impact on the training time of the neural network. In this paper, we show the advantages of optical interconnects for multi-chip machine-learning architecture by demonstrating performance improvements through replacing electrical interconnects with optical ones in an existing multi-chip system. We propose to use highly practical optical interconnect implementation and devise an arithmetic performance model to fairly assess the impact of optical interconnects on a machine-learning accelerator platform. In our evaluation of nine Convolutional Neural Networks with various input sizes, 100 and 400 Gbps optical interconnects reduce the training time by an average of 20.6% and 35.6%, respectively, compared to the baseline system with 25.6 Gbps electrical ones.
topic machine learning
accelerator
optical interconnect
multi-chip architecture
cluster
Convolutional Neural Network (CNN)
url http://www.mdpi.com/2079-9292/7/8/130
work_keys_str_mv AT yuhwanro evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture
AT eojinlee evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture
AT junghoahn evaluatingtheimpactofopticalinterconnectsonamultichipmachinelearningarchitecture
_version_ 1724869407587434496