Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains

Advancements in video object segmentation technology and the availability of efficient object-based video representations, such as MPEG-4 [1], have resulted in the increased availability of arbitrarily shaped digital video content. While this enables many exciting applications, the process of loc...

Full description

Bibliographic Details
Main Author:	Erol, Berna
Format:	Others
Language:	English
Published:	2009
Online Access:	http://hdl.handle.net/2429/12773

id	ndltd-UBC-oai-circle.library.ubc.ca-2429-12773
record_format	oai_dc
spelling	ndltd-UBC-oai-circle.library.ubc.ca-2429-127732018-01-05T17:36:26Z Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains Erol, Berna Advancements in video object segmentation technology and the availability of efficient object-based video representations, such as MPEG-4 [1], have resulted in the increased availability of arbitrarily shaped digital video content. While this enables many exciting applications, the process of locating and accessing a desired video sequence can still be challenging because of the large volume of data associated with even compressed video. This dissertation proposes generic methods for the retrieval of arbitrarily shaped video objects in the MPEG-4 compressed domain, using their shape, local motion, and color content. Considering that a one-minute long video sequence may contain more than 1,500 frames, summarization of video content is necessary as a first step to efficiently retrieve video. Therefore, we first suggest a method for the summarization of arbitrarily shaped video objects. This is achieved by selecting the temporal instants of video objects -based on their compressed domain shape information- that efficiently represent the objects' salient content. Next, we propose to extend some well-proven still shape retrieval techniques to retrieve video objects in the compressed domain. We compute the Fourier and ART (Angular Radial Transform) descriptors on the shape approximations obtained from the MPEG-4 shape coding modes. We also present a method to compute the shape distances between two video objects based on these still shape features. Unlike in the case of still objects, one of the key features that describe a video object is motion. Classification of video objects by their local motion is addressed in this thesis by presenting three new motion descriptors. These descriptors are computed based on the shape deformations of arbitrarily shaped video, and assume no prior knowledge about the video content. Color is one of the most widely used low level features in content-based retrieval. In this thesis, we also study efficient color content matching of arbitrarily shaped video, and in particular, color histogram computation in the MPEG-4 compressed domain. Our experimental results demonstrate that our techniques enable effective and low complexity content-based retrieval. Employing MPEG-4 compressed domain information not only obviates the need for full decompression of the bit stream, hence yielding substantial computational savings, but also allows our techniques to be more robust to segmentation errors. Applied Science, Faculty of Electrical and Computer Engineering, Department of Graduate 2009-09-15T16:28:21Z 2009-09-15T16:28:21Z 2002 2002-05 Text Thesis/Dissertation http://hdl.handle.net/2429/12773 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. 10082499 bytes application/pdf
collection	NDLTD
language	English
format	Others
sources	NDLTD
description	Advancements in video object segmentation technology and the availability of efficient object-based video representations, such as MPEG-4 [1], have resulted in the increased availability of arbitrarily shaped digital video content. While this enables many exciting applications, the process of locating and accessing a desired video sequence can still be challenging because of the large volume of data associated with even compressed video. This dissertation proposes generic methods for the retrieval of arbitrarily shaped video objects in the MPEG-4 compressed domain, using their shape, local motion, and color content. Considering that a one-minute long video sequence may contain more than 1,500 frames, summarization of video content is necessary as a first step to efficiently retrieve video. Therefore, we first suggest a method for the summarization of arbitrarily shaped video objects. This is achieved by selecting the temporal instants of video objects -based on their compressed domain shape information- that efficiently represent the objects' salient content. Next, we propose to extend some well-proven still shape retrieval techniques to retrieve video objects in the compressed domain. We compute the Fourier and ART (Angular Radial Transform) descriptors on the shape approximations obtained from the MPEG-4 shape coding modes. We also present a method to compute the shape distances between two video objects based on these still shape features. Unlike in the case of still objects, one of the key features that describe a video object is motion. Classification of video objects by their local motion is addressed in this thesis by presenting three new motion descriptors. These descriptors are computed based on the shape deformations of arbitrarily shaped video, and assume no prior knowledge about the video content. Color is one of the most widely used low level features in content-based retrieval. In this thesis, we also study efficient color content matching of arbitrarily shaped video, and in particular, color histogram computation in the MPEG-4 compressed domain. Our experimental results demonstrate that our techniques enable effective and low complexity content-based retrieval. Employing MPEG-4 compressed domain information not only obviates the need for full decompression of the bit stream, hence yielding substantial computational savings, but also allows our techniques to be more robust to segmentation errors. === Applied Science, Faculty of === Electrical and Computer Engineering, Department of === Graduate
author	Erol, Berna
spellingShingle	Erol, Berna Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
author_facet	Erol, Berna
author_sort	Erol, Berna
title	Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
title_short	Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
title_full	Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
title_fullStr	Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
title_full_unstemmed	Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
title_sort	content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains
publishDate	2009
url	http://hdl.handle.net/2429/12773
work_keys_str_mv	AT erolberna contentbasedretrievalofarbitrarilyshapedvideoobjectsintheuncompressedandcompresseddomains
_version_	1718589182754947072

Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains

Similar Items