Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control
碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Reinforcement learning has proven effective for solving decision making problems. However, its application to modern video codecs has yet to be seen. This thesis presents an early attempt to introduce reinforcement learning to HEVC/H.265 intra-frame rate contro...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/t942e8 |
id |
ndltd-TW-107NCTU5641016 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NCTU56410162019-05-16T01:40:47Z http://ndltd.ncl.edu.tw/handle/t942e8 Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control 運用增強式學習於HEVC/H.265幀內位元率控制 Hu, Jun-Hao 胡俊顥 碩士 國立交通大學 多媒體工程研究所 107 Reinforcement learning has proven effective for solving decision making problems. However, its application to modern video codecs has yet to be seen. This thesis presents an early attempt to introduce reinforcement learning to HEVC/H.265 intra-frame rate control. The task is to determine the λ value in the rate-distortion optimization for every coding tree unit in a frame, with the objective being to minimize the frame-level distortion subject to a rate constraint. We draw an analogy between the rate control problem and the reinforcement learning problem, by considering the texture complexity of coding tree units and bit balance as the environment state, the λ value as an action that an agent needs to take, and the negative distortion of the coding tree unit as an immediate reward. We train a neural network based on DDPF algorithm to be our agent, which observes the state to evaluate the reward for each possible action. The experimental results show that the proposed model can already perform more accurate and state bit rate control than the rate control algorithm in HM-16.15. Peng, Wen-Hsiao 彭文孝 2018 學位論文 ; thesis 41 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 多媒體工程研究所 === 107 === Reinforcement learning has proven effective for solving decision making problems. However, its application to modern video codecs has yet to be seen. This thesis presents an early attempt to introduce reinforcement learning to HEVC/H.265 intra-frame rate control. The task is to determine the λ value in the rate-distortion optimization for every coding tree unit in a frame, with the objective being to minimize the frame-level distortion subject to a rate constraint. We draw an analogy between the rate control problem and the reinforcement learning problem, by considering the texture complexity of coding tree units and bit balance as the environment state, the λ value as an action that an agent needs to take, and the negative distortion of the coding tree unit as an immediate reward. We train a neural network based on DDPF algorithm to be our agent, which observes the state to evaluate the reward for each possible action. The experimental results show that the proposed model can already perform more accurate and state bit rate control than the rate control algorithm in HM-16.15.
|
author2 |
Peng, Wen-Hsiao |
author_facet |
Peng, Wen-Hsiao Hu, Jun-Hao 胡俊顥 |
author |
Hu, Jun-Hao 胡俊顥 |
spellingShingle |
Hu, Jun-Hao 胡俊顥 Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
author_sort |
Hu, Jun-Hao |
title |
Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
title_short |
Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
title_full |
Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
title_fullStr |
Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
title_full_unstemmed |
Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control |
title_sort |
reinforcement learning for hevc/h.265 intra-frame rate control |
publishDate |
2018 |
url |
http://ndltd.ncl.edu.tw/handle/t942e8 |
work_keys_str_mv |
AT hujunhao reinforcementlearningforhevch265intraframeratecontrol AT hújùnhào reinforcementlearningforhevch265intraframeratecontrol AT hujunhao yùnyòngzēngqiángshìxuéxíyúhevch265zhèngnèiwèiyuánlǜkòngzhì AT hújùnhào yùnyòngzēngqiángshìxuéxíyúhevch265zhèngnèiwèiyuánlǜkòngzhì |
_version_ |
1719178795748950016 |