A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder
In the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decodi...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8865037/ |
id |
doaj-41adf6ba0f2f4fdfabe8aac074848cce |
---|---|
record_format |
Article |
spelling |
doaj-41adf6ba0f2f4fdfabe8aac074848cce2021-03-29T23:56:21ZengIEEEIEEE Access2169-35362019-01-01714909714910410.1109/ACCESS.2019.29469078865037A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC DecoderYibo Fan0Genwei Tang1https://orcid.org/0000-0002-3467-7496Xiaoyang Zeng2State Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaState Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaState Key Laboratory of ASIC and System, Fudan University, Shanghai, ChinaIn the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decoding. It supports all the transform unit (TU) sizes and 35 HEVC intra prediction modes. First, this paper introduces a TU-oriented intra predictor with a throughput of 32 pixels, which can be newly arranged with the TU size. It can be a line of 32 pixels, two lines of 16 pixels, 4 lines of 8 pixels or four lines of four pixels. This TU-oriented architecture allows intra-prediction and inverse discrete cosine transform (IDCT) to be computed in parallel, removing the memory between them. In addition, a horizontal and vertical line buffer for reference sample is proposed, which only cost 0.8K bit and is implemented by register files with SRAM-free. Finally, to further reduce hardware consumption, multipliers can be shared in the prediction. The implementation result shows that the compact architecture supports 8K video application and costs 66.2K logic gates, which is synthesized with the TSMC 65nm process under 400MHz.https://ieeexplore.ieee.org/document/8865037/32-pixelTU-orientedSRAM-freeHEVC intra prediction8K decoder |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yibo Fan Genwei Tang Xiaoyang Zeng |
spellingShingle |
Yibo Fan Genwei Tang Xiaoyang Zeng A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder IEEE Access 32-pixel TU-oriented SRAM-free HEVC intra prediction 8K decoder |
author_facet |
Yibo Fan Genwei Tang Xiaoyang Zeng |
author_sort |
Yibo Fan |
title |
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder |
title_short |
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder |
title_full |
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder |
title_fullStr |
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder |
title_full_unstemmed |
A Compact 32-Pixel TU-Oriented and SRAM-Free Intra Prediction VLSI Architecture for HEVC Decoder |
title_sort |
compact 32-pixel tu-oriented and sram-free intra prediction vlsi architecture for hevc decoder |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
In the High Efficiency Video Coding (HEVC), a variety of CU sizes and intra prediction modes significantly improve coding efficiency, but also bring higher computational complexity. This paper proposes a new compact VLSI architecture for HEVC intra prediction, which is geared towards 8K video decoding. It supports all the transform unit (TU) sizes and 35 HEVC intra prediction modes. First, this paper introduces a TU-oriented intra predictor with a throughput of 32 pixels, which can be newly arranged with the TU size. It can be a line of 32 pixels, two lines of 16 pixels, 4 lines of 8 pixels or four lines of four pixels. This TU-oriented architecture allows intra-prediction and inverse discrete cosine transform (IDCT) to be computed in parallel, removing the memory between them. In addition, a horizontal and vertical line buffer for reference sample is proposed, which only cost 0.8K bit and is implemented by register files with SRAM-free. Finally, to further reduce hardware consumption, multipliers can be shared in the prediction. The implementation result shows that the compact architecture supports 8K video application and costs 66.2K logic gates, which is synthesized with the TSMC 65nm process under 400MHz. |
topic |
32-pixel TU-oriented SRAM-free HEVC intra prediction 8K decoder |
url |
https://ieeexplore.ieee.org/document/8865037/ |
work_keys_str_mv |
AT yibofan acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder AT genweitang acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder AT xiaoyangzeng acompact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder AT yibofan compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder AT genweitang compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder AT xiaoyangzeng compact32pixeltuorientedandsramfreeintrapredictionvlsiarchitectureforhevcdecoder |
_version_ |
1724188863523454976 |