Cooperative high-performance computing with FPGAs - matrix multiply case-study

In high-performance computing, there is great opportunity for systems that use FPGAs to handle communication while also performing computation on data in transit in an ``altruistic'' manner--that is, using resources for computation that might otherwise be used for communication, and in...

Full description

Bibliographic Details
Main Author: Munafo, Robert
Other Authors: Herbordt, Martin C.
Language:en_US
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/2144/30740
id ndltd-bu.edu-oai-open.bu.edu-2144-30740
record_format oai_dc
spelling ndltd-bu.edu-oai-open.bu.edu-2144-307402019-01-08T15:44:27Z Cooperative high-performance computing with FPGAs - matrix multiply case-study Munafo, Robert Herbordt, Martin C. Computer engineering In high-performance computing, there is great opportunity for systems that use FPGAs to handle communication while also performing computation on data in transit in an ``altruistic'' manner--that is, using resources for computation that might otherwise be used for communication, and in a way that improves overall system performance and efficiency. We provide a specific definition of \textbf{Computing in the Network} that captures this opportunity. We then outline some overall requirements and guidelines for cooperative computing that include this ability, and make suggestions for specific computing capabilities to be added to the networking hardware in a system. We then explore some algorithms running on a network so equipped for a few specific computing tasks: dense matrix multiplication, sparse matrix transposition and sparse matrix multiplication. In the first instance we give limits of problem size and estimates of performance that should be attainable with present-day FPGA hardware. 2018-08-10T18:03:49Z 2018-08-10T18:03:49Z 2018 2018-07-03T01:05:17Z Thesis/Dissertation https://hdl.handle.net/2144/30740 en_US Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
collection NDLTD
language en_US
sources NDLTD
topic Computer engineering
spellingShingle Computer engineering
Munafo, Robert
Cooperative high-performance computing with FPGAs - matrix multiply case-study
description In high-performance computing, there is great opportunity for systems that use FPGAs to handle communication while also performing computation on data in transit in an ``altruistic'' manner--that is, using resources for computation that might otherwise be used for communication, and in a way that improves overall system performance and efficiency. We provide a specific definition of \textbf{Computing in the Network} that captures this opportunity. We then outline some overall requirements and guidelines for cooperative computing that include this ability, and make suggestions for specific computing capabilities to be added to the networking hardware in a system. We then explore some algorithms running on a network so equipped for a few specific computing tasks: dense matrix multiplication, sparse matrix transposition and sparse matrix multiplication. In the first instance we give limits of problem size and estimates of performance that should be attainable with present-day FPGA hardware.
author2 Herbordt, Martin C.
author_facet Herbordt, Martin C.
Munafo, Robert
author Munafo, Robert
author_sort Munafo, Robert
title Cooperative high-performance computing with FPGAs - matrix multiply case-study
title_short Cooperative high-performance computing with FPGAs - matrix multiply case-study
title_full Cooperative high-performance computing with FPGAs - matrix multiply case-study
title_fullStr Cooperative high-performance computing with FPGAs - matrix multiply case-study
title_full_unstemmed Cooperative high-performance computing with FPGAs - matrix multiply case-study
title_sort cooperative high-performance computing with fpgas - matrix multiply case-study
publishDate 2018
url https://hdl.handle.net/2144/30740
work_keys_str_mv AT munaforobert cooperativehighperformancecomputingwithfpgasmatrixmultiplycasestudy
_version_ 1718812976309338112