Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System

Presentation slides are now a de facto standard in most classroom lectures, business meetings, and conference talks. Until recently, electronic presentation materials have been disjointed from each other: the video file and the corresponding slides are typically available separately for viewing or d...

Full description

Bibliographic Details
Main Author:	Kharitonova, Yekaterina
Other Authors:	Barnard, Kobus
Language:	en_US
Published:	The University of Arizona. 2016
Subjects:	Computer Science
Online Access:	http://hdl.handle.net/10150/612447 http://arizona.openrepository.com/arizona/handle/10150/612447

id	ndltd-arizona.edu-oai-arizona.openrepository.com-10150-612447
record_format	oai_dc
spelling	ndltd-arizona.edu-oai-arizona.openrepository.com-10150-6124472016-06-11T15:01:30Z Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System Kharitonova, Yekaterina Barnard, Kobus Efrat, Alon Morrison, Clayton T. Surdeanu, Mihai Barnard, Kobus Computer Science Presentation slides are now a de facto standard in most classroom lectures, business meetings, and conference talks. Until recently, electronic presentation materials have been disjointed from each other: the video file and the corresponding slides are typically available separately for viewing or download. In this work, we exploit the fact that video frames of a presentation and the corresponding slides are mapped into one another by a geometric transformation, called a homography. This mapping allows us to synchronize a video with the slides shown in it, enabling users to interactively view presentation materials, and search within and across presentations. We show how we can approximate homographies with affine transformations. Similarly to the original homographies, such transformations allow us to project slides back into the video (i.e., perform backprojection), which improves their resulting appearance. The advantage of our method is that we use homographies to compress the original video, reducing bandwidth used to transmit the video file, and then carry out backprojection using affine transformations on the client side. Additionally, we introduce a novel approach to slide appearance approximation, which improves SIFT-based matching for videos with out-of-plane rotation of the projection screen. This method also allows us to split each slide into three overlapping panels, and generate rotated versions of each such panel. Using these panels during matching, we detect slide's content that is projected onto a speaker (what we call "slide tattoos"). We treat these "tattoos" as implicit structured light, which provides hints about the scene geometry. We then use the homography obtained from detecting "slide tattoos" to compute a fundamental matrix. The main significance of this contribution is that it allows us to infer 3-D information from 2-D presentation materials. Finally, we present the Semantically Linked Instructional Content (SLIC) Portal, an online system for accessing presentations that exploits our slide-video matching. Aspects of the SLIC system fully developed or significantly improved as part of this work include: a publicly-open web collection of video presentations indexed by slides a unified clear interface displaying a video player along with slide images synchronized with their appearance in the video a categorization tree that allows browsing for presentations by topic/category an ability to query slide words within and across presentations; querying is integrated with the"browsing"mode, where the search results can be narrowed to only the selected categories an easy integration with the audio transcript: the ability to preview and search within speech words cross-platform and mobile support. We conducted user studies at the University of Arizona to measure the effect of synchronized presentation materials on learners, and discuss students' favorable response to the SLIC Portal, which they used during the experiments. 2016 text Electronic Dissertation http://hdl.handle.net/10150/612447 http://arizona.openrepository.com/arizona/handle/10150/612447 en_US Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona.
collection	NDLTD
language	en_US
sources	NDLTD
topic	Computer Science
spellingShingle	Computer Science Kharitonova, Yekaterina Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
description	Presentation slides are now a de facto standard in most classroom lectures, business meetings, and conference talks. Until recently, electronic presentation materials have been disjointed from each other: the video file and the corresponding slides are typically available separately for viewing or download. In this work, we exploit the fact that video frames of a presentation and the corresponding slides are mapped into one another by a geometric transformation, called a homography. This mapping allows us to synchronize a video with the slides shown in it, enabling users to interactively view presentation materials, and search within and across presentations. We show how we can approximate homographies with affine transformations. Similarly to the original homographies, such transformations allow us to project slides back into the video (i.e., perform backprojection), which improves their resulting appearance. The advantage of our method is that we use homographies to compress the original video, reducing bandwidth used to transmit the video file, and then carry out backprojection using affine transformations on the client side. Additionally, we introduce a novel approach to slide appearance approximation, which improves SIFT-based matching for videos with out-of-plane rotation of the projection screen. This method also allows us to split each slide into three overlapping panels, and generate rotated versions of each such panel. Using these panels during matching, we detect slide's content that is projected onto a speaker (what we call "slide tattoos"). We treat these "tattoos" as implicit structured light, which provides hints about the scene geometry. We then use the homography obtained from detecting "slide tattoos" to compute a fundamental matrix. The main significance of this contribution is that it allows us to infer 3-D information from 2-D presentation materials. Finally, we present the Semantically Linked Instructional Content (SLIC) Portal, an online system for accessing presentations that exploits our slide-video matching. Aspects of the SLIC system fully developed or significantly improved as part of this work include: a publicly-open web collection of video presentations indexed by slides a unified clear interface displaying a video player along with slide images synchronized with their appearance in the video a categorization tree that allows browsing for presentations by topic/category an ability to query slide words within and across presentations; querying is integrated with the"browsing"mode, where the search results can be narrowed to only the selected categories an easy integration with the audio transcript: the ability to preview and search within speech words cross-platform and mobile support. We conducted user studies at the University of Arizona to measure the effect of synchronized presentation materials on learners, and discuss students' favorable response to the SLIC Portal, which they used during the experiments.
author2	Barnard, Kobus
author_facet	Barnard, Kobus Kharitonova, Yekaterina
author	Kharitonova, Yekaterina
author_sort	Kharitonova, Yekaterina
title	Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
title_short	Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
title_full	Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
title_fullStr	Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
title_full_unstemmed	Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System
title_sort	geometry of presentation videos and slides, and the semantic linking of instructional content (slic) system
publisher	The University of Arizona.
publishDate	2016
url	http://hdl.handle.net/10150/612447 http://arizona.openrepository.com/arizona/handle/10150/612447
work_keys_str_mv	AT kharitonovayekaterina geometryofpresentationvideosandslidesandthesemanticlinkingofinstructionalcontentslicsystem
_version_	1718302444605145088

Geometry of Presentation Videos and Slides, and the Semantic Linking of Instructional Content (SLIC) System

Similar Items