The Study on Model-based Image Coding at Very Low Bit Rate

碩士 === 國立清華大學 === 資訊科學研究所 === 84 ===   In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori know...

Full description

Bibliographic Details
Main Authors: Lee, Wei-Gwo, 李維國
Other Authors: Wang, J. S.
Format: Others
Language:en_US
Published: 1996
Online Access:http://ndltd.ncl.edu.tw/handle/32164435713094731177
id ndltd-TW-084NTHU3394015
record_format oai_dc
spelling ndltd-TW-084NTHU33940152016-07-13T04:10:35Z http://ndltd.ncl.edu.tw/handle/32164435713094731177 The Study on Model-based Image Coding at Very Low Bit Rate 以模塑為基礎超低位元率影像編碼之探討 Lee, Wei-Gwo 李維國 碩士 國立清華大學 資訊科學研究所 84   In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc.   In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG. Wang, J. S. 王家祥 1996 學位論文 ; thesis 38 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊科學研究所 === 84 ===   In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc.   In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG.
author2 Wang, J. S.
author_facet Wang, J. S.
Lee, Wei-Gwo
李維國
author Lee, Wei-Gwo
李維國
spellingShingle Lee, Wei-Gwo
李維國
The Study on Model-based Image Coding at Very Low Bit Rate
author_sort Lee, Wei-Gwo
title The Study on Model-based Image Coding at Very Low Bit Rate
title_short The Study on Model-based Image Coding at Very Low Bit Rate
title_full The Study on Model-based Image Coding at Very Low Bit Rate
title_fullStr The Study on Model-based Image Coding at Very Low Bit Rate
title_full_unstemmed The Study on Model-based Image Coding at Very Low Bit Rate
title_sort study on model-based image coding at very low bit rate
publishDate 1996
url http://ndltd.ncl.edu.tw/handle/32164435713094731177
work_keys_str_mv AT leeweigwo thestudyonmodelbasedimagecodingatverylowbitrate
AT lǐwéiguó thestudyonmodelbasedimagecodingatverylowbitrate
AT leeweigwo yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo
AT lǐwéiguó yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo
AT leeweigwo studyonmodelbasedimagecodingatverylowbitrate
AT lǐwéiguó studyonmodelbasedimagecodingatverylowbitrate
_version_ 1718345159616233472