Document Image Segmentation System Based on Various Document Image Features

碩士 === 明道管理學院 === 管理研究所 === 93 === Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of d...

Full description

Bibliographic Details
Main Authors: Kuan-Ying Huang, 黃冠穎
Other Authors: Chien-Ming Chou
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/06815281515112952952
id ndltd-TW-093MDU07121011
record_format oai_dc
spelling ndltd-TW-093MDU071210112015-12-25T04:10:28Z http://ndltd.ncl.edu.tw/handle/06815281515112952952 Document Image Segmentation System Based on Various Document Image Features 整合各種文件特徵為基礎之文件影像分割系統 Kuan-Ying Huang 黃冠穎 碩士 明道管理學院 管理研究所 93 Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of digital data to provide them to user. However, how to digitize all data in the traditional library is important. The technology of Document Image Analysis (DIA) can achieve this work. In these technologies, document image segmentation is an important step. Its goal is to separate background, texts and pictures from a document image and recognition them. In this thesis, we propose the document image segmentation system based on many kinds of document image features. We present a reliable system for edge detection, localization, extraction, and binarization text from document image. The system can extract background information, words and pictures from the different color document image. Due to document image synthesize many image features such as background, text and picture etc., we employ several image feature extraction methods to extract them. They involve statistical characteristic measures, edge detection, projection, gaussian mixture model and so on. Experimental results have demonstrated the effectiveness and superiority of the propose method after an extensive set of document images is tested. It shows a good performance to this system. Keywords: Digital library, Document image analysis, Document Image segmentation, Color background, Feature extraction, Statistical measure, Edge detection, Projection, Gaussian mixture model. Chien-Ming Chou 周建明 2005 學位論文 ; thesis 63 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 明道管理學院 === 管理研究所 === 93 === Due to the development of Internet is fast, the development of the digital library is more and more important. According to Dr. Daniel Greenstein proposed that at current stage, digital libraries trend to develop the digital collection, and integrate all kinds of digital data to provide them to user. However, how to digitize all data in the traditional library is important. The technology of Document Image Analysis (DIA) can achieve this work. In these technologies, document image segmentation is an important step. Its goal is to separate background, texts and pictures from a document image and recognition them. In this thesis, we propose the document image segmentation system based on many kinds of document image features. We present a reliable system for edge detection, localization, extraction, and binarization text from document image. The system can extract background information, words and pictures from the different color document image. Due to document image synthesize many image features such as background, text and picture etc., we employ several image feature extraction methods to extract them. They involve statistical characteristic measures, edge detection, projection, gaussian mixture model and so on. Experimental results have demonstrated the effectiveness and superiority of the propose method after an extensive set of document images is tested. It shows a good performance to this system. Keywords: Digital library, Document image analysis, Document Image segmentation, Color background, Feature extraction, Statistical measure, Edge detection, Projection, Gaussian mixture model.
author2 Chien-Ming Chou
author_facet Chien-Ming Chou
Kuan-Ying Huang
黃冠穎
author Kuan-Ying Huang
黃冠穎
spellingShingle Kuan-Ying Huang
黃冠穎
Document Image Segmentation System Based on Various Document Image Features
author_sort Kuan-Ying Huang
title Document Image Segmentation System Based on Various Document Image Features
title_short Document Image Segmentation System Based on Various Document Image Features
title_full Document Image Segmentation System Based on Various Document Image Features
title_fullStr Document Image Segmentation System Based on Various Document Image Features
title_full_unstemmed Document Image Segmentation System Based on Various Document Image Features
title_sort document image segmentation system based on various document image features
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/06815281515112952952
work_keys_str_mv AT kuanyinghuang documentimagesegmentationsystembasedonvariousdocumentimagefeatures
AT huángguānyǐng documentimagesegmentationsystembasedonvariousdocumentimagefeatures
AT kuanyinghuang zhěnghégèzhǒngwénjiàntèzhēngwèijīchǔzhīwénjiànyǐngxiàngfēngēxìtǒng
AT huángguānyǐng zhěnghégèzhǒngwénjiàntèzhēngwèijīchǔzhīwénjiànyǐngxiàngfēngēxìtǒng
_version_ 1718157342711742464