Summary: | 碩士 === 國立交通大學 === 多媒體工程研究所 === 105 === This thesis presents an automatic method to label different orientations of building façades in a single image and infer its 3D structure from these geometric labels. While most of the existing works use Manhattan World assumption, our method takes a more general assumption which agrees more than three vanishing points in an image. There are two stages in the proposed algorithm. First, we estimate the coarse orientation map from vanishing lines. To find optimal orientation labels, we propose a multi-cue optimization and consider line segments, texture entropy, SIFT features of an image. Second, we segment the orientation map to irregular polygon patches and recover the 3D scene according to physics-based criteria. We propose a physics-inspired objective function to evaluate the results of 3D structures. The preliminary experiments demonstrate that the proposed method can deal with complicated façades of architectures.
|