Summary: | 碩士 === 元智大學 === 資訊工程學系 === 104 === Camera pose estimation is an important research topic in computer vision. It is a fundamental and crucial step in the applications of augmented reality. Knowing camera pose, we can edit the content of an image such as generating a virtual object in the image. Currently, many camera pose estimation techniques have been developed focusing on small and indoor environment. For large and outdoor environments, camera pose estimation faces some problems such as poor applicability and high computational complexity. Due to this, in this thesis we develop a two-stage camera position estimation system for large outdoor environments where the SFM (Structure from Motion) and an image retrieval technique are integrated in the system. In the stage of SFM, The SURF features in each image are extracted and the 3D structure of the scene are reconstructed. After that, a matching table which maps 2D image features to corresponding 3D points is created for subsequent camera pose estimation. In the stage of image retrieval, a method with high retrieval precision, the CNN (convolutional neural network), is employed to extract similar images from the dataset. In addition, the locality sensitive hashing algorithm is also included in the system to improve the retrieval efficiency. The features in the query image are extracted and matched with the images returned by the retrieval technique. A set of 2D and 3D matching points is established from the matching table and the camera pose is estimated. A series of experiments are conducted and results are encouraging. The camera pose of a query image in a large outdoor environment can be accurately estimated which demonstrates the feasibility of developed system.
|