In Search of the Performance- And Energy-Efficient CNN Accelerators

In this paper, starting from the algorithm, a performance- and energy-efficient 3D structure or shape of the Tensor Processing Engine (TPE) for CNN acceleration is systematically searched and evaluated. An optimal accelerator’s shape maximizes the number of concurrent MAC operations per clock cycle...

Full description

Bibliographic Details
Main Authors:	Sedukhin, S. (Author), Tomioka, Y. (Author), Yamamoto, K. (Author)
Format:	Article
Language:	English
Published:	Institute of Electronics Information Communication Engineers 2022
Subjects:	computing efficiency data reusing multi-channel convolution tensor processing vector-parallel computing
Online Access:	View Fulltext in Publisher


LEADER	01831nam a2200217Ia 4500
001	10.1587-TRANSELE.2021LHP0003
008	220630s2022 CNT 000 0 und d
020			\|a 09168524 (ISSN)
245	1	0	\|a In Search of the Performance- And Energy-Efficient CNN Accelerators
260		0	\|b Institute of Electronics Information Communication Engineers \|c 2022
520	3		\|a In this paper, starting from the algorithm, a performance- and energy-efficient 3D structure or shape of the Tensor Processing Engine (TPE) for CNN acceleration is systematically searched and evaluated. An optimal accelerator’s shape maximizes the number of concurrent MAC operations per clock cycle while minimizes the number of redundant operations. The proposed 3D vector-parallel TPE architecture with an optimal shape can be very efficiently used for considerable CNN acceleration. Due to implemented support of inter-block image data independency, it is possible to use multiple of such TPEs for the additional CNN acceleration. Moreover, it is shown that the proposed TPE can also be uniformly used for acceleration of the different CNN models such as VGG, ResNet, YOLO, and SSD. We also demonstrate that our theoretical efficiency analysis is matched with the result of a real implementation for an SSD model to which a state-of-the-art channel pruning technique is applied. © 2022 The Institute of Electronics, Information and Communication Engineers
650	0	4	\|a computing efficiency
650	0	4	\|a data reusing
650	0	4	\|a multi-channel convolution
650	0	4	\|a tensor processing
650	0	4	\|a vector-parallel computing
700	1	0	\|a Sedukhin, S. \|e author
700	1	0	\|a Tomioka, Y. \|e author
700	1	0	\|a Yamamoto, K. \|e author
773			\|t IEICE Transactions on Electronics
856			\|z View Fulltext in Publisher \|u https://doi.org/10.1587/TRANSELE.2021LHP0003

In Search of the Performance- And Energy-Efficient CNN Accelerators

Similar Items