Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation

碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocati...

Full description

Bibliographic Details
Main Authors: Chen, Lian-Ching, 陳蓮清
Other Authors: Peng, Wen-Hsiao
Format: Others
Language:en_US
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/p75xk6
id ndltd-TW-107NCTU5641018
record_format oai_dc
spelling ndltd-TW-107NCTU56410182019-05-16T01:40:47Z http://ndltd.ncl.edu.tw/handle/p75xk6 Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation 運用增強式學習於HEVC/H.265之幀層位元分配 Chen, Lian-Ching 陳蓮清 碩士 國立交通大學 多媒體工程研究所 107 Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocation for different frames exhibits dependency. To address such dependency, this thesis introduces reinforcement learning. We first consider frame-level texture complexity and bit balance as a state signal, define the bit allocation for each frame as an action, and compute the negative frame-level distortion as an immediate reward signal. We then train a neural network to be our agent, which observes the state to allocate bits to each frame in order to maximize cumulative reward. As compared to the rate control scheme in x265-2.7, our method has smaller bit rate fluctuations. However, the coding performance of our model still has room for improvement. Peng, Wen-Hsiao Lin, Cheng-Chung 彭文孝 林正中 2018 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocation for different frames exhibits dependency. To address such dependency, this thesis introduces reinforcement learning. We first consider frame-level texture complexity and bit balance as a state signal, define the bit allocation for each frame as an action, and compute the negative frame-level distortion as an immediate reward signal. We then train a neural network to be our agent, which observes the state to allocate bits to each frame in order to maximize cumulative reward. As compared to the rate control scheme in x265-2.7, our method has smaller bit rate fluctuations. However, the coding performance of our model still has room for improvement.
author2 Peng, Wen-Hsiao
author_facet Peng, Wen-Hsiao
Chen, Lian-Ching
陳蓮清
author Chen, Lian-Ching
陳蓮清
spellingShingle Chen, Lian-Ching
陳蓮清
Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
author_sort Chen, Lian-Ching
title Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_short Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_full Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_fullStr Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_full_unstemmed Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_sort reinforcement learning for hevc/h.265 frame-level bit allocation
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/p75xk6
work_keys_str_mv AT chenlianching reinforcementlearningforhevch265framelevelbitallocation
AT chénliánqīng reinforcementlearningforhevch265framelevelbitallocation
AT chenlianching yùnyòngzēngqiángshìxuéxíyúhevch265zhīzhèngcéngwèiyuánfēnpèi
AT chénliánqīng yùnyòngzēngqiángshìxuéxíyúhevch265zhīzhèngcéngwèiyuánfēnpèi
_version_ 1719178796772360192