A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder

In the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decodi...

Full description

Bibliographic Details
Main Authors: Yibo Fan, Genwei Tang, Xiaoyang Zeng
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8865037/
id doaj-41adf6ba0f2f4fdfabe8aac074848cce
record_format Article
spelling doaj-41adf6ba0f2f4fdfabe8aac074848cce2021-03-29T23:56:21ZengIEEEIEEE Access2169-35362019-01-01714909714910410.1109/ACCESS.2019.29469078865037A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC DecoderYibo Fan0Genwei Tang1https://orcid.org/0000-0002-3467-7496Xiaoyang Zeng2State Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaState Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaState Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaIn the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decoding. It supports all the transform unit (TU) sizes and 35 HEVC intra prediction modes. First, this paper introduces a TU-oriented intra predictor with a throughput of 32 pixels, which can be newly arranged with the TU size. It can be a line of 32 pixels, two lines of 16 pixels, 4 lines of 8 pixels or four lines of four pixels. This TU-oriented architecture allows intra-prediction and inverse discrete cosine transform (IDCT) to be computed in parallel, removing the memory between them. In addition, a horizontal and vertical line buffer for reference sample is proposed, which only cost 0.8K bit and is implemented by register files with SRAM-free. Finally, to further reduce hardware consumption, multipliers can be shared in the prediction. The implementation result shows that the compact architecture supports 8K video application and costs 66.2K logic gates, which is synthesized with the TSMC 65nm process under 400MHz.https://ieeexplore.ieee.org/document/8865037/32-pixelTU-orientedSRAM-freeHEVC intra prediction8K decoder
collection DOAJ
language English
format Article
sources DOAJ
author Yibo Fan
Genwei Tang
Xiaoyang Zeng
spellingShingle Yibo Fan
Genwei Tang
Xiaoyang Zeng
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
IEEE Access
32-pixel
TU-oriented
SRAM-free
HEVC intra prediction
8K decoder
author_facet Yibo Fan
Genwei Tang
Xiaoyang Zeng
author_sort Yibo Fan
title A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
title_short A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
title_full A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
title_fullStr A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
title_full_unstemmed A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
title_sort compact 32-pixel tu-oriented and sram-free intra prediction vlsi architecture for hevc decoder
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description In the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decoding. It supports all the transform unit (TU) sizes and 35 HEVC intra prediction modes. First, this paper introduces a TU-oriented intra predictor with a throughput of 32 pixels, which can be newly arranged with the TU size. It can be a line of 32 pixels, two lines of 16 pixels, 4 lines of 8 pixels or four lines of four pixels. This TU-oriented architecture allows intra-prediction and inverse discrete cosine transform (IDCT) to be computed in parallel, removing the memory between them. In addition, a horizontal and vertical line buffer for reference sample is proposed, which only cost 0.8K bit and is implemented by register files with SRAM-free. Finally, to further reduce hardware consumption, multipliers can be shared in the prediction. The implementation result shows that the compact architecture supports 8K video application and costs 66.2K logic gates, which is synthesized with the TSMC 65nm process under 400MHz.
topic 32-pixel
TU-oriented
SRAM-free
HEVC intra prediction
8K decoder
url https://ieeexplore.ieee.org/document/8865037/
work_keys_str_mv AT yibofan acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
AT genweitang acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
AT xiaoyangzeng acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
AT yibofan compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
AT genweitang compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
AT xiaoyangzeng compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder
_version_ 1724188863523454976