Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera

碩士 === 國立臺灣大學 === 電子工程學研究所 === 101 === Digital video technology has played an important role in our daily life. With the evolution of the display technologies, display systems can provide higher visual quality to enrich human life. Immersive 3D displays provide better visual experience than convent...

Full description

Bibliographic Details
Main Authors: Cheng-Yuan Ko, 柯政遠
Other Authors: Liang-Gee Chen
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/51761706508335148782
id ndltd-TW-101NTU05428128
record_format oai_dc
spelling ndltd-TW-101NTU054281282015-10-13T23:10:18Z http://ndltd.ncl.edu.tw/handle/51761706508335148782 Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera 利用立體相機之三維互動使用者介面之演算法與硬體架構設計 Cheng-Yuan Ko 柯政遠 碩士 國立臺灣大學 電子工程學研究所 101 Digital video technology has played an important role in our daily life. With the evolution of the display technologies, display systems can provide higher visual quality to enrich human life. Immersive 3D displays provide better visual experience than conventional 2D displays. 3D technology enriches the contents of many applications, such as broadcasting, movie, gaming, photographing, camcorder, education, etc. However, in the case of stereoscopic display is quite mature and image is quite realistic, the user will want to interact with three dimensional virtual objects, such as slapping, sliding, throwing…. In this thesis, we proposed “virtual touch” interaction by using stereo camera. Common interactive way is that user can do some hand gesture or body gesture in front of TV or other devices, and then the system recognizes the gesture and some reaction which is corresponded to this gesture will be appeared. This kind of research is already quite mature, and its function more likes the remote control. Nowadays, in the case of stereoscopic display is quite mature and image is quite realistic, the user will want to interact with three dimensional virtual objects, the so-called "virtual touch" is such as slapping, sliding, throwing…. We proposed a 3D interactive user interface by stereo camera which can detect the user''s hand and body''s location. When the position of user’s hand and position of virtual object are consistent, then the system considers that the user achieve the “virtual touch”, and then the system will recognize the user’s operation, and therefore give the user a so-called "virtual touch" interaction. The 3D interactive user interface by stereo camera is discussed in two different parts: distance estimation by calibration-free captures and 3D hand localization by using belief propagation. The distance estimation by calibration-free captures is the first step of 3D interactive user interface. The main concept is that treats that user as an object, and from the left capture and right capture from stereo camera, calculates the disparity of the user. Finally, the user’s distance can be estimated by disparity of the user. 3D hand localization by using belief propagation is another part of interactive 3D user interface. When we only have the user’s distance from system, we can just do some simple interaction with system. Because of hand gesture is one of the most intuitive and nature ways for people to communicate with machines, so system have to get the user’s hand 3D localization, and thus the user can do more complex control or interaction with system. We use only depth and color information to get the hand’s 3D localization and do some simple gesture recognition to judge the reaction. We also proposed 3-stage architecture for hardware design, and the implementation result shows that the architecture can achieve real-time interaction of input Fll-HD1080p@30fps stereo images when operating at 200MHz. Liang-Gee Chen 陳良基 2013 學位論文 ; thesis 88 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 電子工程學研究所 === 101 === Digital video technology has played an important role in our daily life. With the evolution of the display technologies, display systems can provide higher visual quality to enrich human life. Immersive 3D displays provide better visual experience than conventional 2D displays. 3D technology enriches the contents of many applications, such as broadcasting, movie, gaming, photographing, camcorder, education, etc. However, in the case of stereoscopic display is quite mature and image is quite realistic, the user will want to interact with three dimensional virtual objects, such as slapping, sliding, throwing…. In this thesis, we proposed “virtual touch” interaction by using stereo camera. Common interactive way is that user can do some hand gesture or body gesture in front of TV or other devices, and then the system recognizes the gesture and some reaction which is corresponded to this gesture will be appeared. This kind of research is already quite mature, and its function more likes the remote control. Nowadays, in the case of stereoscopic display is quite mature and image is quite realistic, the user will want to interact with three dimensional virtual objects, the so-called "virtual touch" is such as slapping, sliding, throwing…. We proposed a 3D interactive user interface by stereo camera which can detect the user''s hand and body''s location. When the position of user’s hand and position of virtual object are consistent, then the system considers that the user achieve the “virtual touch”, and then the system will recognize the user’s operation, and therefore give the user a so-called "virtual touch" interaction. The 3D interactive user interface by stereo camera is discussed in two different parts: distance estimation by calibration-free captures and 3D hand localization by using belief propagation. The distance estimation by calibration-free captures is the first step of 3D interactive user interface. The main concept is that treats that user as an object, and from the left capture and right capture from stereo camera, calculates the disparity of the user. Finally, the user’s distance can be estimated by disparity of the user. 3D hand localization by using belief propagation is another part of interactive 3D user interface. When we only have the user’s distance from system, we can just do some simple interaction with system. Because of hand gesture is one of the most intuitive and nature ways for people to communicate with machines, so system have to get the user’s hand 3D localization, and thus the user can do more complex control or interaction with system. We use only depth and color information to get the hand’s 3D localization and do some simple gesture recognition to judge the reaction. We also proposed 3-stage architecture for hardware design, and the implementation result shows that the architecture can achieve real-time interaction of input Fll-HD1080p@30fps stereo images when operating at 200MHz.
author2 Liang-Gee Chen
author_facet Liang-Gee Chen
Cheng-Yuan Ko
柯政遠
author Cheng-Yuan Ko
柯政遠
spellingShingle Cheng-Yuan Ko
柯政遠
Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
author_sort Cheng-Yuan Ko
title Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
title_short Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
title_full Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
title_fullStr Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
title_full_unstemmed Algorithm and Architecture Design of 3D Interactive User Interface by Stereo Camera
title_sort algorithm and architecture design of 3d interactive user interface by stereo camera
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/51761706508335148782
work_keys_str_mv AT chengyuanko algorithmandarchitecturedesignof3dinteractiveuserinterfacebystereocamera
AT kēzhèngyuǎn algorithmandarchitecturedesignof3dinteractiveuserinterfacebystereocamera
AT chengyuanko lìyònglìtǐxiāngjīzhīsānwéihùdòngshǐyòngzhějièmiànzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì
AT kēzhèngyuǎn lìyònglìtǐxiāngjīzhīsānwéihùdòngshǐyòngzhějièmiànzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì
_version_ 1718084438835855360