Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation

碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocati...

Full description

Bibliographic Details
Main Authors:	Chen, Lian-Ching, 陳蓮清
Other Authors:	Peng, Wen-Hsiao
Format:	Others
Language:	en_US
Published:	2018
Online Access:	http://ndltd.ncl.edu.tw/handle/p75xk6

id	ndltd-TW-107NCTU5641018
record_format	oai_dc
spelling	ndltd-TW-107NCTU56410182019-05-16T01:40:47Z http://ndltd.ncl.edu.tw/handle/p75xk6 Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation 運用增強式學習於HEVC/H.265之幀層位元分配 Chen, Lian-Ching 陳蓮清碩士國立交通大學多媒體工程研究所 107 Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocation for different frames exhibits dependency. To address such dependency, this thesis introduces reinforcement learning. We first consider frame-level texture complexity and bit balance as a state signal, define the bit allocation for each frame as an action, and compute the negative frame-level distortion as an immediate reward signal. We then train a neural network to be our agent, which observes the state to allocate bits to each frame in order to maximize cumulative reward. As compared to the rate control scheme in x265-2.7, our method has smaller bit rate fluctuations. However, the coding performance of our model still has room for improvement. Peng, Wen-Hsiao Lin, Cheng-Chung 彭文孝林正中 2018 學位論文 ; thesis 33 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Frame-level bit allocation is crucial to video rate control. The problem is often cast as minimizing the distortions of a group of video frames subjective to a rate constraint. When these video frames are related through inter-frame prediction, the bit allocation for different frames exhibits dependency. To address such dependency, this thesis introduces reinforcement learning. We first consider frame-level texture complexity and bit balance as a state signal, define the bit allocation for each frame as an action, and compute the negative frame-level distortion as an immediate reward signal. We then train a neural network to be our agent, which observes the state to allocate bits to each frame in order to maximize cumulative reward. As compared to the rate control scheme in x265-2.7, our method has smaller bit rate fluctuations. However, the coding performance of our model still has room for improvement.
author2	Peng, Wen-Hsiao
author_facet	Peng, Wen-Hsiao Chen, Lian-Ching 陳蓮清
author	Chen, Lian-Ching 陳蓮清
spellingShingle	Chen, Lian-Ching 陳蓮清 Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
author_sort	Chen, Lian-Ching
title	Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_short	Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_full	Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_fullStr	Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_full_unstemmed	Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation
title_sort	reinforcement learning for hevc/h.265 frame-level bit allocation
publishDate	2018
url	http://ndltd.ncl.edu.tw/handle/p75xk6
work_keys_str_mv	AT chenlianching reinforcementlearningforhevch265framelevelbitallocation AT chénliánqīng reinforcementlearningforhevch265framelevelbitallocation AT chenlianching yùnyòngzēngqiángshìxuéxíyúhevch265zhīzhèngcéngwèiyuánfēnpèi AT chénliánqīng yùnyòngzēngqiángshìxuéxíyúhevch265zhīzhèngcéngwèiyuánfēnpèi
_version_	1719178796772360192

Reinforcement Learning for HEVC/H.265 Frame-level Bit Allocation

Similar Items