A VLSI design concept for parallel iterative algorithms

Modern VLSI manufacturing technology has kept shrinking down to the nanoscale level with a very fast trend. Integration with the advanced nano-technology now makes it possible to realize advanced parallel iterative algorithms directly which was almost impossible 10 years ago. In this paper, we want...

Full description

Bibliographic Details
Main Authors: C. C. Sun, J. Götze
Format: Article
Language:deu
Published: Copernicus Publications 2009-05-01
Series:Advances in Radio Science
Online Access:http://www.adv-radio-sci.net/7/95/2009/ars-7-95-2009.pdf
id doaj-d64c2d41d8034b33bc012fefaba32352
record_format Article
spelling doaj-d64c2d41d8034b33bc012fefaba323522020-11-24T23:08:53ZdeuCopernicus PublicationsAdvances in Radio Science 1684-99651684-99732009-05-01795100A VLSI design concept for parallel iterative algorithmsC. C. SunJ. GötzeModern VLSI manufacturing technology has kept shrinking down to the nanoscale level with a very fast trend. Integration with the advanced nano-technology now makes it possible to realize advanced parallel iterative algorithms directly which was almost impossible 10 years ago. In this paper, we want to discuss the influences of evolving VLSI technologies for iterative algorithms and present design strategies from an algorithmic and architectural point of view. Implementing an iterative algorithm on a multiprocessor array, there is a trade-off between the performance/complexity of processors and the load/throughput of interconnects. This is due to the behavior of iterative algorithms. For example, we could simplify the parallel implementation of the iterative algorithm (i.e., processor elements of the multiprocessor array) in any way as long as the convergence is guaranteed. However, the modification of the algorithm (processors) usually increases the number of required iterations which also means that the switch activity of interconnects is increasing. As an example we show that a 25×25 full Jacobi EVD array could be realized into one single FPGA device with the simplified μ-rotation CORDIC architecture. http://www.adv-radio-sci.net/7/95/2009/ars-7-95-2009.pdf
collection DOAJ
language deu
format Article
sources DOAJ
author C. C. Sun
J. Götze
spellingShingle C. C. Sun
J. Götze
A VLSI design concept for parallel iterative algorithms
Advances in Radio Science
author_facet C. C. Sun
J. Götze
author_sort C. C. Sun
title A VLSI design concept for parallel iterative algorithms
title_short A VLSI design concept for parallel iterative algorithms
title_full A VLSI design concept for parallel iterative algorithms
title_fullStr A VLSI design concept for parallel iterative algorithms
title_full_unstemmed A VLSI design concept for parallel iterative algorithms
title_sort vlsi design concept for parallel iterative algorithms
publisher Copernicus Publications
series Advances in Radio Science
issn 1684-9965
1684-9973
publishDate 2009-05-01
description Modern VLSI manufacturing technology has kept shrinking down to the nanoscale level with a very fast trend. Integration with the advanced nano-technology now makes it possible to realize advanced parallel iterative algorithms directly which was almost impossible 10 years ago. In this paper, we want to discuss the influences of evolving VLSI technologies for iterative algorithms and present design strategies from an algorithmic and architectural point of view. Implementing an iterative algorithm on a multiprocessor array, there is a trade-off between the performance/complexity of processors and the load/throughput of interconnects. This is due to the behavior of iterative algorithms. For example, we could simplify the parallel implementation of the iterative algorithm (i.e., processor elements of the multiprocessor array) in any way as long as the convergence is guaranteed. However, the modification of the algorithm (processors) usually increases the number of required iterations which also means that the switch activity of interconnects is increasing. As an example we show that a 25×25 full Jacobi EVD array could be realized into one single FPGA device with the simplified μ-rotation CORDIC architecture.
url http://www.adv-radio-sci.net/7/95/2009/ars-7-95-2009.pdf
work_keys_str_mv AT ccsun avlsidesignconceptforparalleliterativealgorithms
AT jgotze avlsidesignconceptforparalleliterativealgorithms
AT ccsun vlsidesignconceptforparalleliterativealgorithms
AT jgotze vlsidesignconceptforparalleliterativealgorithms
_version_ 1725612582961676288