The Study on Model-based Image Coding at Very Low Bit Rate

碩士 === 國立清華大學 === 資訊科學研究所 === 84 === 　　In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori know...

Full description

Bibliographic Details
Main Authors:	Lee, Wei-Gwo, 李維國
Other Authors:	Wang, J. S.
Format:	Others
Language:	en_US
Published:	1996
Online Access:	http://ndltd.ncl.edu.tw/handle/32164435713094731177

id	ndltd-TW-084NTHU3394015
record_format	oai_dc
spelling	ndltd-TW-084NTHU33940152016-07-13T04:10:35Z http://ndltd.ncl.edu.tw/handle/32164435713094731177 The Study on Model-based Image Coding at Very Low Bit Rate 以模塑為基礎超低位元率影像編碼之探討 Lee, Wei-Gwo 李維國碩士國立清華大學資訊科學研究所 84 　　In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc. 　　In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG. Wang, J. S. 王家祥 1996 學位論文 ; thesis 38 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊科學研究所 === 84 === 　　In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc. 　　In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG.
author2	Wang, J. S.
author_facet	Wang, J. S. Lee, Wei-Gwo 李維國
author	Lee, Wei-Gwo 李維國
spellingShingle	Lee, Wei-Gwo 李維國 The Study on Model-based Image Coding at Very Low Bit Rate
author_sort	Lee, Wei-Gwo
title	The Study on Model-based Image Coding at Very Low Bit Rate
title_short	The Study on Model-based Image Coding at Very Low Bit Rate
title_full	The Study on Model-based Image Coding at Very Low Bit Rate
title_fullStr	The Study on Model-based Image Coding at Very Low Bit Rate
title_full_unstemmed	The Study on Model-based Image Coding at Very Low Bit Rate
title_sort	study on model-based image coding at very low bit rate
publishDate	1996
url	http://ndltd.ncl.edu.tw/handle/32164435713094731177
work_keys_str_mv	AT leeweigwo thestudyonmodelbasedimagecodingatverylowbitrate AT lǐwéiguó thestudyonmodelbasedimagecodingatverylowbitrate AT leeweigwo yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo AT lǐwéiguó yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo AT leeweigwo studyonmodelbasedimagecodingatverylowbitrate AT lǐwéiguó studyonmodelbasedimagecodingatverylowbitrate
_version_	1718345159616233472

The Study on Model-based Image Coding at Very Low Bit Rate

Similar Items