Matching Slides to Presentation Videos

Video streaming is becoming a major channel for distance learning (or e-learning). A tremendous number of videos for educational purpose are capturedand archived in various e-learning systems today throughout schools, corporations and over the Internet. However, making information searchable and br...

Full description

Bibliographic Details
Main Author: Fan, Quanfu
Other Authors: Barnard, Kobus
Language:EN
Published: The University of Arizona. 2008
Subjects:
Online Access:http://hdl.handle.net/10150/195757
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-195757
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-1957572015-10-23T04:43:09Z Matching Slides to Presentation Videos Fan, Quanfu Barnard, Kobus Barnard, Kobus Amir, Arnon Efrat, Alon Moon, Bongki computer vision video browsing indexing slides Video streaming is becoming a major channel for distance learning (or e-learning). A tremendous number of videos for educational purpose are capturedand archived in various e-learning systems today throughout schools, corporations and over the Internet. However, making information searchable and browsable, and presenting results optimally for a wide range of users and systems, remains a challenge.In this work two core algorithms have been developedto support effective browsing and searching of educational videos. The first is a fully automatic approach that recognizes slides in the videowith high accuracy. Built upon SIFT (scale invariant feature transformation) keypoint matching using RANSAC (random sample consensus), the approach is independent of capture systems and can handle a variety of videos with different styles and plentiful ambiguities. In particular, we propose a multi-phase matching pipeline that incrementally identifies slides from the easy ones to the difficult ones. We achieve further robustness by using the matching confidence as part of a dynamic Hidden Markov model (HMM) that integrates temporal information, taking camera operations into account as well.The second algorithm locates slides in the video. We develop a non-linear optimization method (bundle adjustment) to accurately estimate the projective transformations (homographies) between slides and video frames. Different from estimating homography from a single image, our method solves a set of homographies jointly in a frame sequence that is related to a single slide.These two algorithms open up a series of possibilities for making the video content more searchable, browsable and understandable, thus greatly enriching the user's learning experience. Their usefulness has been demonstrated in the SLIC (Semantically Linking Instructional Content) system, which aims to turnsimple video content into fully interactive learning experience for students and scholars. 2008 text Electronic Dissertation http://hdl.handle.net/10150/195757 659749595 2597 EN Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona.
collection NDLTD
language EN
sources NDLTD
topic computer vision
video browsing
indexing
slides
spellingShingle computer vision
video browsing
indexing
slides
Fan, Quanfu
Matching Slides to Presentation Videos
description Video streaming is becoming a major channel for distance learning (or e-learning). A tremendous number of videos for educational purpose are capturedand archived in various e-learning systems today throughout schools, corporations and over the Internet. However, making information searchable and browsable, and presenting results optimally for a wide range of users and systems, remains a challenge.In this work two core algorithms have been developedto support effective browsing and searching of educational videos. The first is a fully automatic approach that recognizes slides in the videowith high accuracy. Built upon SIFT (scale invariant feature transformation) keypoint matching using RANSAC (random sample consensus), the approach is independent of capture systems and can handle a variety of videos with different styles and plentiful ambiguities. In particular, we propose a multi-phase matching pipeline that incrementally identifies slides from the easy ones to the difficult ones. We achieve further robustness by using the matching confidence as part of a dynamic Hidden Markov model (HMM) that integrates temporal information, taking camera operations into account as well.The second algorithm locates slides in the video. We develop a non-linear optimization method (bundle adjustment) to accurately estimate the projective transformations (homographies) between slides and video frames. Different from estimating homography from a single image, our method solves a set of homographies jointly in a frame sequence that is related to a single slide.These two algorithms open up a series of possibilities for making the video content more searchable, browsable and understandable, thus greatly enriching the user's learning experience. Their usefulness has been demonstrated in the SLIC (Semantically Linking Instructional Content) system, which aims to turnsimple video content into fully interactive learning experience for students and scholars.
author2 Barnard, Kobus
author_facet Barnard, Kobus
Fan, Quanfu
author Fan, Quanfu
author_sort Fan, Quanfu
title Matching Slides to Presentation Videos
title_short Matching Slides to Presentation Videos
title_full Matching Slides to Presentation Videos
title_fullStr Matching Slides to Presentation Videos
title_full_unstemmed Matching Slides to Presentation Videos
title_sort matching slides to presentation videos
publisher The University of Arizona.
publishDate 2008
url http://hdl.handle.net/10150/195757
work_keys_str_mv AT fanquanfu matchingslidestopresentationvideos
_version_ 1718099664759160832