A Study of Mode Analysis and Fast Mode Decision for Multi-view Video Coding

碩士 === 國立暨南國際大學 === 電機工程學系 === 97 === Currently, 3D video has become the major topic in television system that provides the viewer with reality experience. However, the high transmission bandwidth, the huge data storage, and the complex computational time of 3D video make it difficult to be realized...

Full description

Bibliographic Details
Main Authors: Ho-Ju Lin, 林鶴儒
Other Authors: Pei-Jun Lee
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/71965854523452879332
Description
Summary:碩士 === 國立暨南國際大學 === 電機工程學系 === 97 === Currently, 3D video has become the major topic in television system that provides the viewer with reality experience. However, the high transmission bandwidth, the huge data storage, and the complex computational time of 3D video make it difficult to be realized for home TV user. Thus, an efficient compression algorithm for 3D video is a major task. Joint Video Team (JVT) developed a multi-view video coding (MVC) that based on H.264 codec which uses the RDO selection for the best coding mode from inter modes and intra modes. Although, the selection of the best coding mode by RDO improves the compression performance but the computational complexity is increased. The exhaustive search for all inter modes and intra modes for inter-frame coding cause the encoder to take a large computational time while less than 4% blocks are chosen as intra-mode in real sequences. Therefore, this thesis proposed a fast mode decision algorithm to improve the coding efficiency of MVC, and achieves low computational complexity while maintain good quality of the reconstructed frame. This thesis used the structure of hierarchical B pictures for the proposed algorithm. The coding mode selection is divided into two main operations: basic view (view 0) and multi-view (view 1-7). For the basic view, the mode selection is categorized into two types: 1) B1, 2) B2B3. In B1, the value of RD-cost for Skip mode is employed to determine the best coding mode. In B2B3, this thesis uses the neighboring macroblock’s coding mode correlation to determine the most suitable coding mode. For the multi-view (view1-7) coding selection, the similar area between reference and current frame is analyzed. If MB belongs to the similar region between the reference and current frame, the MB in the similar regions will be chosen as inter-mode, else the temporal direction will be employed to determine the coding mode for the current macroblock. Experimental results show that the encoding time for the basic view and multi-view is saved up to 78%, and 80%, respectively, and the quality of multi-view video is almost remained.