A Highly Parallel and Scalable Motion Estimation Algorithm with GPU for HEVC

We propose a highly parallel and scalable motion estimation algorithm, named multilevel resolution motion estimation (MLRME for short), by combining the advantages of local full search and downsampling. By subsampling a video frame, a large amount of computation is saved. While using the local full-...

Full description

Bibliographic Details
Main Authors: Yun-gang Xue, Hua-you Su, Ju Ren, Mei Wen, Chun-yuan Zhang, Li-quan Xiao
Format: Article
Language:English
Published: Hindawi Limited 2017-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2017/1431574
Description
Summary:We propose a highly parallel and scalable motion estimation algorithm, named multilevel resolution motion estimation (MLRME for short), by combining the advantages of local full search and downsampling. By subsampling a video frame, a large amount of computation is saved. While using the local full-search method, it can exploit massive parallelism and make full use of the powerful modern many-core accelerators, such as GPU and Intel Xeon Phi. We implanted the proposed MLRME into HM12.0, and the experimental results showed that the encoding quality of the MLRME method is close to that of the fast motion estimation in HEVC, which declines by less than 1.5%. We also implemented the MLRME with CUDA, which obtained 30–60x speed-up compared to the serial algorithm on single CPU. Specifically, the parallel implementation of MLRME on a GTX 460 GPU can meet the real-time coding requirement with about 25 fps for the 2560×1600 video format, while, for 832×480, the performance is more than 100 fps.
ISSN:1058-9244
1875-919X