Document Image Segmentation System Based on Various Document Image Features
碩士 === 明道管理學院 === 管理研究所 === 93 === Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of d...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/06815281515112952952 |
id |
ndltd-TW-093MDU07121011 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093MDU071210112015-12-25T04:10:28Z http://ndltd.ncl.edu.tw/handle/06815281515112952952 Document Image Segmentation System Based on Various Document Image Features 整合各種文件特徵為基礎之文件影像分割系統 Kuan-Ying Huang 黃冠穎 碩士 明道管理學院 管理研究所 93 Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of digital data to provide them to user. However, how to digitize all data in the traditional library is important. The technology of Document Image Analysis (DIA) can achieve this work. In these technologies, document image segmentation is an important step. Its goal is to separate background, texts and pictures from a document image and recognition them. In this thesis, we propose the document image segmentation system based on many kinds of document image features. We present a reliable system for edge detection, localization, extraction, and binarization text from document image. The system can extract background information, words and pictures from the different color document image. Due to document image synthesize many image features such as background, text and picture etc., we employ several image feature extraction methods to extract them. They involve statistical characteristic measures, edge detection, projection, gaussian mixture model and so on. Experimental results have demonstrated the effectiveness and superiority of the propose method after an extensive set of document images is tested. It shows a good performance to this system. Keywords: Digital library, Document image analysis, Document Image segmentation, Color background, Feature extraction, Statistical measure, Edge detection, Projection, Gaussian mixture model. Chien-Ming Chou 周建明 2005 學位論文 ; thesis 63 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 明道管理學院 === 管理研究所 === 93 === Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of digital data to provide them to user. However, how to digitize all data in the traditional library is important. The technology of Document Image Analysis (DIA) can achieve this work. In these technologies, document image segmentation is an important step. Its goal is to separate background, texts and pictures from a document image and recognition them.
In this thesis, we propose the document image segmentation system based on many kinds of document image features. We present a reliable system for edge detection, localization, extraction, and binarization text from document image. The system can extract background information, words and pictures from the different color document image. Due to document image synthesize many image features such as background, text and picture etc., we employ several image feature extraction methods to extract them. They involve statistical characteristic measures, edge detection, projection, gaussian mixture model and so on.
Experimental results have demonstrated the effectiveness and superiority of the propose method after an extensive set of document images is tested. It shows a good performance to this system.
Keywords: Digital library, Document image analysis, Document Image segmentation, Color background, Feature extraction, Statistical measure, Edge detection, Projection, Gaussian mixture model.
|
author2 |
Chien-Ming Chou |
author_facet |
Chien-Ming Chou Kuan-Ying Huang 黃冠穎 |
author |
Kuan-Ying Huang 黃冠穎 |
spellingShingle |
Kuan-Ying Huang 黃冠穎 Document Image Segmentation System Based on Various Document Image Features |
author_sort |
Kuan-Ying Huang |
title |
Document Image Segmentation System Based on Various Document Image Features |
title_short |
Document Image Segmentation System Based on Various Document Image Features |
title_full |
Document Image Segmentation System Based on Various Document Image Features |
title_fullStr |
Document Image Segmentation System Based on Various Document Image Features |
title_full_unstemmed |
Document Image Segmentation System Based on Various Document Image Features |
title_sort |
document image segmentation system based on various document image features |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/06815281515112952952 |
work_keys_str_mv |
AT kuanyinghuang documentimagesegmentationsystembasedonvariousdocumentimagefeatures AT huángguānyǐng documentimagesegmentationsystembasedonvariousdocumentimagefeatures AT kuanyinghuang zhěnghégèzhǒngwénjiàntèzhēngwèijīchǔzhīwénjiànyǐngxiàngfēngēxìtǒng AT huángguānyǐng zhěnghégèzhǒngwénjiàntèzhēngwèijīchǔzhīwénjiànyǐngxiàngfēngēxìtǒng |
_version_ |
1718157342711742464 |