Time-multiplexed FPGA overlay networks on chip
How do we design a communication network for processing elements (PEs) on a single chip that minimizes application communication time and area? In designing such a network it is essential to use a network communication pattern that matches application communication and area requirements. This report...
Main Author: | |
---|---|
Format: | Others |
Published: |
2006
|
Online Access: | https://thesis.library.caltech.edu/2335/1/final.pdf Mehta, Nikil (2006) Time-multiplexed FPGA overlay networks on chip. Master's thesis, California Institute of Technology. doi:10.7907/WZTS-XR26. https://resolver.caltech.edu/CaltechETD:etd-05312006-164103 <https://resolver.caltech.edu/CaltechETD:etd-05312006-164103> |
id |
ndltd-CALTECH-oai-thesis.library.caltech.edu-2335 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-CALTECH-oai-thesis.library.caltech.edu-23352019-12-22T03:07:15Z Time-multiplexed FPGA overlay networks on chip Mehta, Nikil How do we design a communication network for processing elements (PEs) on a single chip that minimizes application communication time and area? In designing such a network it is essential to use a network communication pattern that matches application communication and area requirements. This report characterizes the design space of a particular communication pattern for networks on chip: Time-Multiplexed Interconnect. In contrast to more commonly used packet-switched networks, which route communication dynamically, time-multiplexed networks schedule all possible communication prior to runtime with an offline router. We describe how to build well engineered, highly scalable time-multiplexed FPGA networks in terms of topology selection, routing algorithm design and hardware design that operate on a Xilinx XC2V6000-4 at 166MHz. To benchmark our networks we use real, communication rich applications instead of generating synthetic traffic. We show that over all areas (10K–10M slices) and over all applications the best "one topology fits all" is Butterfly Fat Tree (BFT) with c = 1, p = 0.5, which requires, in the worst case, 6.1x as many cycles to route communication than the optimal topology. We compare time-multiplexing to packet-switching, and show that on average, over all applications for all equivalent topologies, online packet-switched communication requires 1.5x as many cycles to route as offline time-multiplexed scheduling. When applying designs to equivalent area, for areas <100K slices packet-switching typically outperforms time-multiplexing, but at >100K slices packet-switching requires up to 3.4x as many cycles to route as time-multiplexing in the worst case. Finally, for equivalent, large networks (>100 PEs) time-multiplexing outperforms packet-switching for communication loads where greater than 10% of all logical links are active. This demonstrates that well designed time-multiplexed FPGA overlay networks can deliver performance and area efficiency exceeding that of packet-switched networks. 2006 Thesis NonPeerReviewed application/pdf https://thesis.library.caltech.edu/2335/1/final.pdf https://resolver.caltech.edu/CaltechETD:etd-05312006-164103 Mehta, Nikil (2006) Time-multiplexed FPGA overlay networks on chip. Master's thesis, California Institute of Technology. doi:10.7907/WZTS-XR26. https://resolver.caltech.edu/CaltechETD:etd-05312006-164103 <https://resolver.caltech.edu/CaltechETD:etd-05312006-164103> https://thesis.library.caltech.edu/2335/ |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
description |
How do we design a communication network for processing elements (PEs) on a single chip that minimizes application communication time and area? In designing such a network it is essential to use a network communication pattern that matches application communication and area requirements. This report characterizes the design space of a particular communication pattern for networks on chip: Time-Multiplexed Interconnect. In contrast to more commonly used packet-switched networks, which route communication dynamically, time-multiplexed networks schedule all possible communication prior to runtime with an offline router. We describe how to build well engineered, highly scalable time-multiplexed FPGA networks in terms of topology selection, routing algorithm design and hardware design that operate on a Xilinx XC2V6000-4 at 166MHz. To benchmark our networks we use real, communication rich applications instead of generating synthetic traffic. We show that over all areas (10K–10M slices) and over all applications the best "one topology fits all" is Butterfly Fat Tree (BFT) with c = 1, p = 0.5, which requires, in the worst case, 6.1x as many cycles to route communication than the optimal topology. We compare time-multiplexing to packet-switching, and show that on average, over all applications for all equivalent topologies, online packet-switched communication requires 1.5x as many cycles to route as offline time-multiplexed scheduling. When applying designs to equivalent area, for areas <100K slices packet-switching typically outperforms time-multiplexing, but at >100K slices packet-switching requires up to 3.4x as many cycles to route as time-multiplexing in the worst case. Finally, for equivalent, large networks (>100 PEs) time-multiplexing outperforms packet-switching for communication loads where greater than 10% of all logical links are active. This demonstrates that well designed time-multiplexed FPGA overlay networks can deliver performance and area efficiency exceeding that of packet-switched networks. |
author |
Mehta, Nikil |
spellingShingle |
Mehta, Nikil Time-multiplexed FPGA overlay networks on chip |
author_facet |
Mehta, Nikil |
author_sort |
Mehta, Nikil |
title |
Time-multiplexed FPGA overlay networks on chip |
title_short |
Time-multiplexed FPGA overlay networks on chip |
title_full |
Time-multiplexed FPGA overlay networks on chip |
title_fullStr |
Time-multiplexed FPGA overlay networks on chip |
title_full_unstemmed |
Time-multiplexed FPGA overlay networks on chip |
title_sort |
time-multiplexed fpga overlay networks on chip |
publishDate |
2006 |
url |
https://thesis.library.caltech.edu/2335/1/final.pdf Mehta, Nikil (2006) Time-multiplexed FPGA overlay networks on chip. Master's thesis, California Institute of Technology. doi:10.7907/WZTS-XR26. https://resolver.caltech.edu/CaltechETD:etd-05312006-164103 <https://resolver.caltech.edu/CaltechETD:etd-05312006-164103> |
work_keys_str_mv |
AT mehtanikil timemultiplexedfpgaoverlaynetworksonchip |
_version_ |
1719304701321674752 |