A Knowledge Document Retrieval Model using Semantic Analysis and Document Summarization Technologies

碩士 === 南華大學 === 資訊管理學系 === 100 ===   It is a common practice to acquire information and knowledge from the Internet; thus, keyword searching, document classification and other technologies have been developed to facilitate searching. Although the search engine sites can narrow down the scope of sea...

Full description

Bibliographic Details
Main Authors: Yu-ting Gong, 龔鈺婷
Other Authors: Shih-ting Yang
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/30792117037217188421
Description
Summary:碩士 === 南華大學 === 資訊管理學系 === 100 ===   It is a common practice to acquire information and knowledge from the Internet; thus, keyword searching, document classification and other technologies have been developed to facilitate searching. Although the search engine sites can narrow down the scope of search, knowledge demanders without background knowledge in the specific fields need to continuously search and receive feedbacks. Hence, this paper develops a Knowledge Document Retrieval model using semantic analysis and document summarization technologies for domain knowledge documents. First, this paper analyzes the ergonomic technology reports from the website of “Institute of Occupational Safety and Health” to capture the expressions and related vocabulary of domain knowledge documents to develop the knowledge vocabulary database. Second, through the Question and Answer Analysis (QAA) module, the correlations between proper names and query strings can be obtained. Third, based on Conceptual Vocabulary Determination (CVD) module, the most conceptual or representative sentences of domain documents can be derived and serve as candidate sentences for structured summarization. Finally, the Document Structured Summarization (DSS) module is used to calculate and retrieve representative sentences of the documents and integrate them into summary for knowledge demanders. It is expected that knowledge demanders can directly read the desired parts according to problems to ensure they can find document they want within a short time. In order to demonstrate applicability of the proposed methodology, a web-based knowledge document retrieval system is also established based on the proposed model. Furthermore, the knowledge documents (i.e., ergonomic technology reports) from the website of “Institute of Occupational Safety and Health” are applied as examples to evaluate the proposed model. The verification results show that the developed system is a high-performance knowledge document retrieval system. As a whole, this research provides an approach for knowledge demanders to efficiently and accurately acquire the domain knowledge documents.