The perspective rectification for non-planar documents
碩士 === 國立中央大學 === 資訊工程研究所 === 99 === Recently, digital cameras become a universal device due to its cost down. People can capture images at will in any time. In the past, text images can only be acquired by scanning documents using scanners. Currently, we can obtain the images of any kinds of object...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/66491614538096707522 |
id |
ndltd-TW-099NCU05392096 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099NCU053920962017-07-14T04:27:43Z http://ndltd.ncl.edu.tw/handle/66491614538096707522 The perspective rectification for non-planar documents 非共平面文件影像透視矯正 Tsung-Hsien Wu 吳宗憲 碩士 國立中央大學 資訊工程研究所 99 Recently, digital cameras become a universal device due to its cost down. People can capture images at will in any time. In the past, text images can only be acquired by scanning documents using scanners. Currently, we can obtain the images of any kinds of objects by simply using cameras within second. Resulting from the influences of environments existing in our daily life while capturing the object images, some new research topics arise, such as the space with uneven light-illumination and scene with more than one plane. Those new problems will definitely affect the performance of OCR (optical character recognition) drastically. Instead of focusing on OCR study which can already correctly recognize a single character with very high recognition rate currently, we devoting ourselves on slicing and obtaining a character from the images captured under poor conditions. For instance, the difference in view angles and texts do not distribute on the same plane. In this thesis, the research focuses on rectifying the documents with perspective distortions caused by different view angles while capturing images and various planes that texts locate on the image. In our work, we specially focus on classifying and rectifying images resided on cylinders, cubes of non-coplanar, and those captured through non-vertical light axis lens. This study provides an effective way in splitting an image with different planes and rectifying the split regions. The effective method in rectifying documents is designed mainly by using image processing techniques, such as connected-component labeling for extracting image information, text line extraction water flow algorithm, and image plane analysis. The proposed method can rectify those common document images with perspective distortion caused by non-singular planes without needing the information of document border and typesetting. Experimental results verify the feasibility and validity of our proposed method. Kuo-Chin Fan 范國清 2011 學位論文 ; thesis 94 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程研究所 === 99 === Recently, digital cameras become a universal device due to its cost down. People can capture images at will in any time. In the past, text images can only be acquired by scanning documents using scanners. Currently, we can obtain the images of any kinds of objects by simply using cameras within second. Resulting from the influences of environments existing in our daily life while capturing the object images, some new research topics arise, such as the space with uneven light-illumination and scene with more than one plane. Those new problems will definitely affect the performance of OCR (optical character recognition) drastically.
Instead of focusing on OCR study which can already correctly recognize a single character with very high recognition rate currently, we devoting ourselves on slicing and obtaining a character from the images captured under poor conditions. For instance, the difference in view angles and texts do not distribute on the same plane. In this thesis, the research focuses on rectifying the documents with perspective distortions caused by different view angles while capturing images and various planes that texts locate on the image. In our work, we specially focus on classifying and rectifying images resided on cylinders, cubes of non-coplanar, and those captured through non-vertical light axis lens.
This study provides an effective way in splitting an image with different planes and rectifying the split regions. The effective method in rectifying documents is designed mainly by using image processing techniques, such as connected-component labeling for extracting image information, text line extraction water flow algorithm, and image plane analysis. The proposed method can rectify those common document images with perspective distortion caused by non-singular planes without needing the information of document border and typesetting. Experimental results verify the feasibility and validity of our proposed method.
|
author2 |
Kuo-Chin Fan |
author_facet |
Kuo-Chin Fan Tsung-Hsien Wu 吳宗憲 |
author |
Tsung-Hsien Wu 吳宗憲 |
spellingShingle |
Tsung-Hsien Wu 吳宗憲 The perspective rectification for non-planar documents |
author_sort |
Tsung-Hsien Wu |
title |
The perspective rectification for non-planar documents |
title_short |
The perspective rectification for non-planar documents |
title_full |
The perspective rectification for non-planar documents |
title_fullStr |
The perspective rectification for non-planar documents |
title_full_unstemmed |
The perspective rectification for non-planar documents |
title_sort |
perspective rectification for non-planar documents |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/66491614538096707522 |
work_keys_str_mv |
AT tsunghsienwu theperspectiverectificationfornonplanardocuments AT wúzōngxiàn theperspectiverectificationfornonplanardocuments AT tsunghsienwu fēigòngpíngmiànwénjiànyǐngxiàngtòushìjiǎozhèng AT wúzōngxiàn fēigòngpíngmiànwénjiànyǐngxiàngtòushìjiǎozhèng AT tsunghsienwu perspectiverectificationfornonplanardocuments AT wúzōngxiàn perspectiverectificationfornonplanardocuments |
_version_ |
1718495986915999744 |