Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques

碩士 === 元智大學 === 資訊工程學系 === 104 === Camera pose estimation is an important research topic in computer vision. It is a fundamental and crucial step in the applications of augmented reality. Knowing camera pose, we can edit the content of an image such as generating a virtual object in the image. Curre...

Full description

Bibliographic Details
Main Authors: Yu-Liang Chen, 陳瑜靚
Other Authors: K. Robert Lai
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/3s496w
id ndltd-TW-104YZU05392011
record_format oai_dc
spelling ndltd-TW-104YZU053920112019-05-15T22:34:37Z http://ndltd.ncl.edu.tw/handle/3s496w Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques 基於SFM和圖像檢索技術的攝影機定位研究 Yu-Liang Chen 陳瑜靚 碩士 元智大學 資訊工程學系 104 Camera pose estimation is an important research topic in computer vision. It is a fundamental and crucial step in the applications of augmented reality. Knowing camera pose, we can edit the content of an image such as generating a virtual object in the image. Currently, many camera pose estimation techniques have been developed focusing on small and indoor environment. For large and outdoor environments, camera pose estimation faces some problems such as poor applicability and high computational complexity. Due to this, in this thesis we develop a two-stage camera position estimation system for large outdoor environments where the SFM (Structure from Motion) and an image retrieval technique are integrated in the system. In the stage of SFM, The SURF features in each image are extracted and the 3D structure of the scene are reconstructed. After that, a matching table which maps 2D image features to corresponding 3D points is created for subsequent camera pose estimation. In the stage of image retrieval, a method with high retrieval precision, the CNN (convolutional neural network), is employed to extract similar images from the dataset. In addition, the locality sensitive hashing algorithm is also included in the system to improve the retrieval efficiency. The features in the query image are extracted and matched with the images returned by the retrieval technique. A set of 2D and 3D matching points is established from the matching table and the camera pose is estimated. A series of experiments are conducted and results are encouraging. The camera pose of a query image in a large outdoor environment can be accurately estimated which demonstrates the feasibility of developed system. K. Robert Lai Chin-Hung Teng 賴國華 鄧進宏 2016 學位論文 ; thesis 52 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊工程學系 === 104 === Camera pose estimation is an important research topic in computer vision. It is a fundamental and crucial step in the applications of augmented reality. Knowing camera pose, we can edit the content of an image such as generating a virtual object in the image. Currently, many camera pose estimation techniques have been developed focusing on small and indoor environment. For large and outdoor environments, camera pose estimation faces some problems such as poor applicability and high computational complexity. Due to this, in this thesis we develop a two-stage camera position estimation system for large outdoor environments where the SFM (Structure from Motion) and an image retrieval technique are integrated in the system. In the stage of SFM, The SURF features in each image are extracted and the 3D structure of the scene are reconstructed. After that, a matching table which maps 2D image features to corresponding 3D points is created for subsequent camera pose estimation. In the stage of image retrieval, a method with high retrieval precision, the CNN (convolutional neural network), is employed to extract similar images from the dataset. In addition, the locality sensitive hashing algorithm is also included in the system to improve the retrieval efficiency. The features in the query image are extracted and matched with the images returned by the retrieval technique. A set of 2D and 3D matching points is established from the matching table and the camera pose is estimated. A series of experiments are conducted and results are encouraging. The camera pose of a query image in a large outdoor environment can be accurately estimated which demonstrates the feasibility of developed system.
author2 K. Robert Lai
author_facet K. Robert Lai
Yu-Liang Chen
陳瑜靚
author Yu-Liang Chen
陳瑜靚
spellingShingle Yu-Liang Chen
陳瑜靚
Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
author_sort Yu-Liang Chen
title Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
title_short Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
title_full Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
title_fullStr Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
title_full_unstemmed Estimating Camera Position based on Structure from Motion and Image Retrieval Techniques
title_sort estimating camera position based on structure from motion and image retrieval techniques
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/3s496w
work_keys_str_mv AT yuliangchen estimatingcamerapositionbasedonstructurefrommotionandimageretrievaltechniques
AT chényújìng estimatingcamerapositionbasedonstructurefrommotionandimageretrievaltechniques
AT yuliangchen jīyúsfmhétúxiàngjiǎnsuǒjìshùdeshèyǐngjīdìngwèiyánjiū
AT chényújìng jīyúsfmhétúxiàngjiǎnsuǒjìshùdeshèyǐngjīdìngwèiyánjiū
_version_ 1719132991464144896