Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes
博士 === 國立中正大學 === 資訊工程研究所 === 89 === The spatial attribute is the most natural way to describe an image for human beings. However, generating the symbolic representation by automatically extracting objects from an original image is very difficult. The color attribute is a very important...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2000
|
Online Access: | http://ndltd.ncl.edu.tw/handle/68710183283439231139 |
id |
ndltd-TW-089CCU00392084 |
---|---|
record_format |
oai_dc |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立中正大學 === 資訊工程研究所 === 89 === The spatial attribute is the most natural way to describe an image for human beings. However, generating the symbolic representation by automatically extracting objects from an original image is very difficult. The color attribute is a very important cue in extracting information from images; based on the color attribute, the database system can be implemented automatically without human intervention. The goal of this dissertation is to investigate the spatial and color information of images and to evolve an effective and efficient similar image retrieval system from both attributes above. Additionally, this dissertation still extends the techniques of the image retrieval to those of video retrieval.
To accord with the requirements of different applications, this dissertation presents a spatial similar image retrieval method based on the concept of tolerable difference of direction. In this method, users can determine the tolerable difference of direction to satisfy their requirements by themselves. This dissertation also extends it to recognize rotational invariance images, in which it is needless to specify where the rotation center is.
The relation of spatial and temporal plays an important role in indexing video information. This dissertation defines a 9DLT string to outline the spatial relationships among the objects of a symbolic image, and proposes a dynamic programming method to indicate the matching sequence of query frames and database frames. It also defines a common component binary tree (CCBT) to store a set of 9DLT strings. This structure cuts down not only a great portion of storage space, but also the computing time of image matching distance between query video frames and database video frames.
Traditional methods obtain the minimal video frame sequence matching (VFSM) distance between two video frame sequences Q and V by calculating the image matching distances of all the possible frame pairs (qi, vj). Here, qi is the i-th frame of Q, and vj is the j-th frame of V. Obviously, it is a time-consuming strategy. This dissertation provides a piecewise method to overcome this drawback, which needs to compute the image matching distances of the frame pairs (qi, vj) only if qi is similar to vj.
To speed up video retrieval, this dissertation is engaged in developing a fast filter to prune off a large number of unqualified videos. This filter uses an image signature to indicate the spatial information of the objects in a video frame. It also introduces a cutting method to split a video into segments, where the image signatures in a segment are similar. The proposed filter, requiring only little memory space, can quickly cut away a large number of unqualified videos.
Color histogram is the most commonly used color feature in image retrieval techniques. However, this feature cannot effectively characterize an image since it captures only the global properties. To make the retrieval more accurate, this dissertation introduces the run-length feature. The feature integrates the color distribution and shape information of the objects in an image. It can effectively discriminate the directions, sizes, and geometrical shapes of objects. Yet, extracting the run-length feature is time-consuming. For this reason, this dissertation also provides a revision of the run-length, called semi-run-length.
This dissertation also introduces three simple and effective image features ─ the color moment (CM), color variance of adjacent pixels (CVAP), and CM-CVAP. The CM feature delineates the color-spatial information of images, and the CVAP feature describes the color variance of pixels in an image. However, these two features can only characterize the content of images in different ways. This dissertation hence provides another feature CM-CVAP, which combines both features above, to raise the quality of similarity measure.
This dissertation proposes another three image features as well ─ the regional color moment feature (RCM), the wavelet-based feature, and the RCM-wavelet feature, which are useful for retrieving color images from an image database. The RCM feature is a revised version of the CM feature. The wavelet-based feature can describe not only the local and global color variances of pixels, but also some geographic shapes of objects in an image. The RCM-wavelet feature integrates the RCM and wavelet-based features to raise the quality of similarity measure.
To get the image retrieving more efficient, this dissertation still presents a fast simple filter in terms of the color and spatial distribution of images. Requiring little storage space, the proposed filter can prune away many unqualified images quickly.
|
author2 |
Chin-Chen Chang |
author_facet |
Chin-Chen Chang Yung-Kuan Chan 詹永寬 |
author |
Yung-Kuan Chan 詹永寬 |
spellingShingle |
Yung-Kuan Chan 詹永寬 Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
author_sort |
Yung-Kuan Chan |
title |
Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
title_short |
Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
title_full |
Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
title_fullStr |
Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
title_full_unstemmed |
Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes |
title_sort |
similar image and video retrieval systems based on spatial and color attributes |
publishDate |
2000 |
url |
http://ndltd.ncl.edu.tw/handle/68710183283439231139 |
work_keys_str_mv |
AT yungkuanchan similarimageandvideoretrievalsystemsbasedonspatialandcolorattributes AT zhānyǒngkuān similarimageandvideoretrievalsystemsbasedonspatialandcolorattributes AT yungkuanchan yǐkōngjiānyǔyánsèwèijīchǔdexiāngshìyǐngxiàngyǔshìxùnyǐngxiàngcháxúnxìtǒng AT zhānyǒngkuān yǐkōngjiānyǔyánsèwèijīchǔdexiāngshìyǐngxiàngyǔshìxùnyǐngxiàngcháxúnxìtǒng |
_version_ |
1718336462083063808 |
spelling |
ndltd-TW-089CCU003920842016-07-06T04:09:53Z http://ndltd.ncl.edu.tw/handle/68710183283439231139 Similar Image and Video Retrieval Systems Based on Spatial and Color Attributes 以空間與顏色為基礎的相似影像與視訊影像查詢系統 Yung-Kuan Chan 詹永寬 博士 國立中正大學 資訊工程研究所 89 The spatial attribute is the most natural way to describe an image for human beings. However, generating the symbolic representation by automatically extracting objects from an original image is very difficult. The color attribute is a very important cue in extracting information from images; based on the color attribute, the database system can be implemented automatically without human intervention. The goal of this dissertation is to investigate the spatial and color information of images and to evolve an effective and efficient similar image retrieval system from both attributes above. Additionally, this dissertation still extends the techniques of the image retrieval to those of video retrieval. To accord with the requirements of different applications, this dissertation presents a spatial similar image retrieval method based on the concept of tolerable difference of direction. In this method, users can determine the tolerable difference of direction to satisfy their requirements by themselves. This dissertation also extends it to recognize rotational invariance images, in which it is needless to specify where the rotation center is. The relation of spatial and temporal plays an important role in indexing video information. This dissertation defines a 9DLT string to outline the spatial relationships among the objects of a symbolic image, and proposes a dynamic programming method to indicate the matching sequence of query frames and database frames. It also defines a common component binary tree (CCBT) to store a set of 9DLT strings. This structure cuts down not only a great portion of storage space, but also the computing time of image matching distance between query video frames and database video frames. Traditional methods obtain the minimal video frame sequence matching (VFSM) distance between two video frame sequences Q and V by calculating the image matching distances of all the possible frame pairs (qi, vj). Here, qi is the i-th frame of Q, and vj is the j-th frame of V. Obviously, it is a time-consuming strategy. This dissertation provides a piecewise method to overcome this drawback, which needs to compute the image matching distances of the frame pairs (qi, vj) only if qi is similar to vj. To speed up video retrieval, this dissertation is engaged in developing a fast filter to prune off a large number of unqualified videos. This filter uses an image signature to indicate the spatial information of the objects in a video frame. It also introduces a cutting method to split a video into segments, where the image signatures in a segment are similar. The proposed filter, requiring only little memory space, can quickly cut away a large number of unqualified videos. Color histogram is the most commonly used color feature in image retrieval techniques. However, this feature cannot effectively characterize an image since it captures only the global properties. To make the retrieval more accurate, this dissertation introduces the run-length feature. The feature integrates the color distribution and shape information of the objects in an image. It can effectively discriminate the directions, sizes, and geometrical shapes of objects. Yet, extracting the run-length feature is time-consuming. For this reason, this dissertation also provides a revision of the run-length, called semi-run-length. This dissertation also introduces three simple and effective image features ─ the color moment (CM), color variance of adjacent pixels (CVAP), and CM-CVAP. The CM feature delineates the color-spatial information of images, and the CVAP feature describes the color variance of pixels in an image. However, these two features can only characterize the content of images in different ways. This dissertation hence provides another feature CM-CVAP, which combines both features above, to raise the quality of similarity measure. This dissertation proposes another three image features as well ─ the regional color moment feature (RCM), the wavelet-based feature, and the RCM-wavelet feature, which are useful for retrieving color images from an image database. The RCM feature is a revised version of the CM feature. The wavelet-based feature can describe not only the local and global color variances of pixels, but also some geographic shapes of objects in an image. The RCM-wavelet feature integrates the RCM and wavelet-based features to raise the quality of similarity measure. To get the image retrieving more efficient, this dissertation still presents a fast simple filter in terms of the color and spatial distribution of images. Requiring little storage space, the proposed filter can prune away many unqualified images quickly. Chin-Chen Chang 張真誠 2000 學位論文 ; thesis 150 en_US |