A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines
We describe a heuristic scheduling approach for optimizing floating-point pipelines subject to input port constraints. The objective of our technique is to maximize functional unit reuse while minimizing the following performance metrics in the generated circuit: (1) maximum multiplexer fanin, (2)...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2013-01-01
|
Series: | International Journal of Reconfigurable Computing |
Online Access: | http://dx.doi.org/10.1155/2013/849545 |
id |
doaj-f0a6f532ba4e4d6ca854941781b15766 |
---|---|
record_format |
Article |
spelling |
doaj-f0a6f532ba4e4d6ca854941781b157662020-11-24T21:23:18ZengHindawi LimitedInternational Journal of Reconfigurable Computing1687-71951687-72092013-01-01201310.1155/2013/849545849545A Heuristic Scheduler for Port-Constrained Floating-Point PipelinesZheming Jin0Jason D. Bakos1Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USADepartment of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USAWe describe a heuristic scheduling approach for optimizing floating-point pipelines subject to input port constraints. The objective of our technique is to maximize functional unit reuse while minimizing the following performance metrics in the generated circuit: (1) maximum multiplexer fanin, (2) datapath fanout, (3) number of multiplexers, and (4) number of registers. For a set of systems biology markup language (SBML) benchmark expressions, we compare the resource usages given by our method to those given by a branch-and-bound enumeration of all valid schedules. Compared with the enumeration results, our heuristic requires on average 33.4% less multiplexer bits and 32.9% less register bits than the worse case, while only requiring 14% more multiplexer bits and 4.5% more register bits than the optimal case. We also compare our results against those given by the state-of-art high-level synthesis tool Xilinx AutoESL. For the most complex of our benchmark expressions, our synthesis technique requires 20% less FPGA slices than AutoESL.http://dx.doi.org/10.1155/2013/849545 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zheming Jin Jason D. Bakos |
spellingShingle |
Zheming Jin Jason D. Bakos A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines International Journal of Reconfigurable Computing |
author_facet |
Zheming Jin Jason D. Bakos |
author_sort |
Zheming Jin |
title |
A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines |
title_short |
A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines |
title_full |
A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines |
title_fullStr |
A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines |
title_full_unstemmed |
A Heuristic Scheduler for Port-Constrained Floating-Point Pipelines |
title_sort |
heuristic scheduler for port-constrained floating-point pipelines |
publisher |
Hindawi Limited |
series |
International Journal of Reconfigurable Computing |
issn |
1687-7195 1687-7209 |
publishDate |
2013-01-01 |
description |
We describe a heuristic scheduling approach for optimizing floating-point pipelines subject to input port constraints. The objective of our technique is to maximize functional unit reuse while minimizing the following performance metrics in the generated circuit: (1) maximum multiplexer fanin, (2) datapath fanout, (3) number of multiplexers, and (4) number of registers. For a set of systems biology markup language (SBML) benchmark expressions, we compare the resource usages given by our method to those given by a branch-and-bound enumeration of all valid schedules. Compared with the enumeration results, our heuristic requires on average 33.4% less multiplexer bits and 32.9% less register bits than the worse case, while only requiring 14% more multiplexer bits and 4.5% more register bits than the optimal case. We also compare our results against those given by the state-of-art high-level synthesis tool Xilinx AutoESL. For the most complex of our benchmark expressions, our synthesis technique requires 20% less FPGA slices than AutoESL. |
url |
http://dx.doi.org/10.1155/2013/849545 |
work_keys_str_mv |
AT zhemingjin aheuristicschedulerforportconstrainedfloatingpointpipelines AT jasondbakos aheuristicschedulerforportconstrainedfloatingpointpipelines AT zhemingjin heuristicschedulerforportconstrainedfloatingpointpipelines AT jasondbakos heuristicschedulerforportconstrainedfloatingpointpipelines |
_version_ |
1725992295684112384 |