The Study on Model-based Image Coding at Very Low Bit Rate
碩士 === 國立清華大學 === 資訊科學研究所 === 84 === In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori know...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
1996
|
Online Access: | http://ndltd.ncl.edu.tw/handle/32164435713094731177 |
id |
ndltd-TW-084NTHU3394015 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-084NTHU33940152016-07-13T04:10:35Z http://ndltd.ncl.edu.tw/handle/32164435713094731177 The Study on Model-based Image Coding at Very Low Bit Rate 以模塑為基礎超低位元率影像編碼之探討 Lee, Wei-Gwo 李維國 碩士 國立清華大學 資訊科學研究所 84 In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc. In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG. Wang, J. S. 王家祥 1996 學位論文 ; thesis 38 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊科學研究所 === 84 ===
In this paper, we propose a prototype of very low bit rate model-based image codec. It is designated to encode head-andshoulder image sequences, in which whose statistical and structural characteristics can be modeled based on the large amount of a priori knowledge. From the view of such image sequence coding, the subjects to be coded are mainly the 3-D head motions and the facial expressions. The encoder extracts the motion and expression parameters from input image sequences and transmits them to the decoder. The decoder synthesizes the output image sequences through the transmitted parameters. Clearly the transmitted information is much less than the original, that is, the compression ratio can be up to thousands of times. Therefore, it can be easily apply to the image sequence with large frame size, such as 512x512 or 640x480, etc.
In our codec, a generic 3D head model with thousands of triangle polygons is used. For each image frame F of the given image sequence, we adjust the model by comparing feature points in the projected head model with that in F to construct a personalized pseudo-3D model for F. The block matching algorithm is applied to find the displacement vectors of feature points between two consecutive frames. These vectors will be coded and transitted to the decoder then. In decoding, we apply a simple 2D warping technique to mapping the polygonal textures (obtained in the first frame ) to reconstruct each image frame. In our experience, the total bitcount is less than lk per 640x480 frame if the residuals (for some acting areas) can be further coded to improve the quality. The residuals coding we used follows JPEG.
|
author2 |
Wang, J. S. |
author_facet |
Wang, J. S. Lee, Wei-Gwo 李維國 |
author |
Lee, Wei-Gwo 李維國 |
spellingShingle |
Lee, Wei-Gwo 李維國 The Study on Model-based Image Coding at Very Low Bit Rate |
author_sort |
Lee, Wei-Gwo |
title |
The Study on Model-based Image Coding at Very Low Bit Rate |
title_short |
The Study on Model-based Image Coding at Very Low Bit Rate |
title_full |
The Study on Model-based Image Coding at Very Low Bit Rate |
title_fullStr |
The Study on Model-based Image Coding at Very Low Bit Rate |
title_full_unstemmed |
The Study on Model-based Image Coding at Very Low Bit Rate |
title_sort |
study on model-based image coding at very low bit rate |
publishDate |
1996 |
url |
http://ndltd.ncl.edu.tw/handle/32164435713094731177 |
work_keys_str_mv |
AT leeweigwo thestudyonmodelbasedimagecodingatverylowbitrate AT lǐwéiguó thestudyonmodelbasedimagecodingatverylowbitrate AT leeweigwo yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo AT lǐwéiguó yǐmósùwèijīchǔchāodīwèiyuánlǜyǐngxiàngbiānmǎzhītàntǎo AT leeweigwo studyonmodelbasedimagecodingatverylowbitrate AT lǐwéiguó studyonmodelbasedimagecodingatverylowbitrate |
_version_ |
1718345159616233472 |