Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures
Main Author: | |
---|---|
Language: | English |
Published: |
Ohio University / OhioLINK
2017
|
Subjects: | |
Online Access: | http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1500992706350957 |
id |
ndltd-OhioLink-oai-etd.ohiolink.edu-ohiou1500992706350957 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-OhioLink-oai-etd.ohiolink.edu-ohiou15009927063509572021-08-03T07:03:35Z Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures Van Winkle, Scott E. Computer Engineering Electrical Engineering CPU GPU Heterogeneous Network on Chip Photonic Machine Learning Dynamic bandwidth scaling Dynamic power scaling As the relentless quest for higher throughput and lower energy cost continues in heterogenous multicores, there is a strong demand for energy-efficient and high-performance Network-on-Chip (NoC) architectures. Heterogenous architectures that can simultaneously utilize both the serialized nature of the CPU as well as the thread level parallelism of the GPU are gaining traction in the industry. A critical issue with heterogenous architectures is finding an optimal way to utilize the shared resources such as the last level cache (LLC) and NoC without hindering the performance of either the CPU or the GPU core. Photonic interconnects are a disruptive technology solution that have the potential to increase the bandwidth, reduce latency, and improve energy-efficiency over traditional metallic interconnects. In this thesis, we propose a CPU-GPU heterogenous architecture called SHARP (Shared Heterogenous Architecture with Reconfigurable Photonic Network-on-Chip) that combines CPU and GPU cores around the same router. SHARP architecture is designed as a Single-Writer Multiple-Reader (SWMR) crossbar with reservation-assist to connect CPU/GPU cores. The architecture consists of 32 CPU cores and 64 GPU computational units. As network traffic exhibits temporal and spatial fluctuations due to application behavior, SHARP can dynamically reallocate bandwidth and thereby adapt to application demands. In this thesis, we propose to dynamically reallocate bandwidth and reduce power consumption by evaluating buffer utilization. While buffer utilization is a reactive technique that deals with fluctuations in application demands, we also propose a proactive technique wherein we use machine learning (ML) to optimize the bandwidth and power consumption. In ML, instead of predicting the buffer utilization, we predict the number of packets that will be generated by the heterogenous cluster. Simulation results where evaluated using PARSEC 2.1 and SPLASH2 benchmark suits for the CPU and OpenCL SDK benchmark suite for the GPU. Our simulation results demonstrate a 34% performance (throughput) improvement over a baseline electrical CMESH while consuming 25% less energy per bit when dynamically reallocating bandwidth. Further simulation results have also shown 6.9% to 14.9% performance improvement over other flavors of the proposed SHARP architecture without dynamic bandwidth allocation. When dynamically scaling laser power, the laser power scaling without ML demonstrates an 8.2% throughput loss with a 60.5% laser power improvement of a high laser power baseline when the reservation window is set to 25 cycles. With a reservation window size of 2000 cycles, the laser power scaling with ML has a negligible throughput loss when compared to the high laser power baseline with a 42% laser power savings. With a reservation window size of 500 cycles, the laser power scaling with ML demonstrates a 14.6% throughput loss with a 65.5% power savings. 2017-09-20 English text Ohio University / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1500992706350957 http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1500992706350957 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws. |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
topic |
Computer Engineering Electrical Engineering CPU GPU Heterogeneous Network on Chip Photonic Machine Learning Dynamic bandwidth scaling Dynamic power scaling |
spellingShingle |
Computer Engineering Electrical Engineering CPU GPU Heterogeneous Network on Chip Photonic Machine Learning Dynamic bandwidth scaling Dynamic power scaling Van Winkle, Scott E. Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
author |
Van Winkle, Scott E. |
author_facet |
Van Winkle, Scott E. |
author_sort |
Van Winkle, Scott E. |
title |
Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
title_short |
Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
title_full |
Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
title_fullStr |
Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
title_full_unstemmed |
Dynamic Bandwidth and Laser Scaling for CPU-GPU Heterogenous Network-on-Chip Architectures |
title_sort |
dynamic bandwidth and laser scaling for cpu-gpu heterogenous network-on-chip architectures |
publisher |
Ohio University / OhioLINK |
publishDate |
2017 |
url |
http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1500992706350957 |
work_keys_str_mv |
AT vanwinklescotte dynamicbandwidthandlaserscalingforcpugpuheterogenousnetworkonchiparchitectures |
_version_ |
1719452746327785472 |