Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation

碩士 === 國立臺灣大學 === 電子工程學研究所 === 106 === In recent years, the display resolution on end devices is enhancing rapidly. The super resolution aims to enhance visual quality when low-resolution contents play at high-resolution displays. Besides, high frame-rate is also an important issue to make the scene...

Full description

Bibliographic Details
Main Authors: Chung-Yan Chih, 池忠諺
Other Authors: 陳良基
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/e3h95p
id ndltd-TW-106NTU05428029
record_format oai_dc
spelling ndltd-TW-106NTU054280292019-05-16T00:22:53Z http://ndltd.ncl.edu.tw/handle/e3h95p Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation 基於CNN方法與多層計算之即時超級解析度架構設計 Chung-Yan Chih 池忠諺 碩士 國立臺灣大學 電子工程學研究所 106 In recent years, the display resolution on end devices is enhancing rapidly. The super resolution aims to enhance visual quality when low-resolution contents play at high-resolution displays. Besides, high frame-rate is also an important issue to make the scene smooth. On the other hand, deep learning is growing rapidly in recent years, it can achieve better performance compared to the original methods in many computer vision domain, including super resolution. In super resolution domain, we find that the learning based methods are better than the non-learning-based methods as expected. But learning-based methods have massive computation, real-time performance is hard to achieve. The truth is, the architectures with non-learning based methods which can achieve real-time performance are rare. In upscaling option, most of the hardware can only perform upscale 2 and most of the architectures'' output resolution is below full-HD. We proposed a customized ASIC implementation of super resolution which can perform 60 fps upscaled full-HD or even higher output by different upscaling factors. Our architecture is learning based super resolution algorithm. We first compare the difference between high-resolution (HR) network and low-resolution (HR) network and decide to use LR network for computation cost concern. We then decide to use "Fast Super Resolution Convolutional Neural Network" (FSRCNN) to implement. Since the original FSRCNN has deconvolution layer but the major part is convolution, we decide to modify the deconvolution filter into multi-size convolution filters to make the hardware has higher utilization. We propose our architecture for hardware friendly FSRCNN and we define the bandwidth problem and use a different strategy to reduce the bandwidth and memory cost. Instead of layer-wise computation, which is commonly used in convolutional neural network hardware accelerator, we use multi-layer computation to reduce bandwidth requirement and enhance throughput. We arrange data flow and store reuse data by using level B data reuse to make our work a memory efficient architecture We implement an FSRCNN accelerator with a lower bandwidth, smaller on-chip memory, and high frame-rate. 陳良基 2017 學位論文 ; thesis 82 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 電子工程學研究所 === 106 === In recent years, the display resolution on end devices is enhancing rapidly. The super resolution aims to enhance visual quality when low-resolution contents play at high-resolution displays. Besides, high frame-rate is also an important issue to make the scene smooth. On the other hand, deep learning is growing rapidly in recent years, it can achieve better performance compared to the original methods in many computer vision domain, including super resolution. In super resolution domain, we find that the learning based methods are better than the non-learning-based methods as expected. But learning-based methods have massive computation, real-time performance is hard to achieve. The truth is, the architectures with non-learning based methods which can achieve real-time performance are rare. In upscaling option, most of the hardware can only perform upscale 2 and most of the architectures'' output resolution is below full-HD. We proposed a customized ASIC implementation of super resolution which can perform 60 fps upscaled full-HD or even higher output by different upscaling factors. Our architecture is learning based super resolution algorithm. We first compare the difference between high-resolution (HR) network and low-resolution (HR) network and decide to use LR network for computation cost concern. We then decide to use "Fast Super Resolution Convolutional Neural Network" (FSRCNN) to implement. Since the original FSRCNN has deconvolution layer but the major part is convolution, we decide to modify the deconvolution filter into multi-size convolution filters to make the hardware has higher utilization. We propose our architecture for hardware friendly FSRCNN and we define the bandwidth problem and use a different strategy to reduce the bandwidth and memory cost. Instead of layer-wise computation, which is commonly used in convolutional neural network hardware accelerator, we use multi-layer computation to reduce bandwidth requirement and enhance throughput. We arrange data flow and store reuse data by using level B data reuse to make our work a memory efficient architecture We implement an FSRCNN accelerator with a lower bandwidth, smaller on-chip memory, and high frame-rate.
author2 陳良基
author_facet 陳良基
Chung-Yan Chih
池忠諺
author Chung-Yan Chih
池忠諺
spellingShingle Chung-Yan Chih
池忠諺
Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
author_sort Chung-Yan Chih
title Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
title_short Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
title_full Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
title_fullStr Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
title_full_unstemmed Architecture Design for Real-Time CNN-Based Super Resolution Using Multi-Layer Computation
title_sort architecture design for real-time cnn-based super resolution using multi-layer computation
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/e3h95p
work_keys_str_mv AT chungyanchih architecturedesignforrealtimecnnbasedsuperresolutionusingmultilayercomputation
AT chízhōngyàn architecturedesignforrealtimecnnbasedsuperresolutionusingmultilayercomputation
AT chungyanchih jīyúcnnfāngfǎyǔduōcéngjìsuànzhījíshíchāojíjiěxīdùjiàgòushèjì
AT chízhōngyàn jīyúcnnfāngfǎyǔduōcéngjìsuànzhījíshíchāojíjiěxīdùjiàgòushèjì
_version_ 1719165215689408512