Term Association-based Hypertext Information Retrieval
碩士 === 中原大學 === 資訊工程學系 === 88 === Information retrievals, such as Boolean, vector and probability models, use index terms to get the documents of related contents. However, the relationships among index terms are usually ignored. Such relationships, no matter whether semantic or quantitative, may...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2000
|
Online Access: | http://ndltd.ncl.edu.tw/handle/24024970252713456332 |
id |
ndltd-TW-088CYCU0392020 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-088CYCU03920202015-10-13T11:53:30Z http://ndltd.ncl.edu.tw/handle/24024970252713456332 Term Association-based Hypertext Information Retrieval 以關鍵字相關性為基礎之超本文資訊檢索系統 Shao-Chun Li 李紹群 碩士 中原大學 資訊工程學系 88 Information retrievals, such as Boolean, vector and probability models, use index terms to get the documents of related contents. However, the relationships among index terms are usually ignored. Such relationships, no matter whether semantic or quantitative, may be valuable within retrievals. This paper utilized the quantitative associations of index terms to form a weighted undirected graph, called TAG (term association graph). A TAG can be obtained from the association rules of a structured document. A structured document consisting of structures such as chapter, section, heading and hyperlink can also be represented as an undirected graph, called Structure Document Graph (SDG). For any document, the index term content can be obtained from its SDG. Then the corresponding TAG for this document can be calculated under the criterion of minimal accumulated association. When applied to hypertext information retrieval, TAG can be used to find the similarities of index terms, then the similarity between query and document. A term association-based hypertext information retrieval system is established to implement our idea. The web documents are translated into SDGs; subsequently, the corresponding TAGs can be found and then clustered in several groups. Real examples prove that such system can retrieve related documents successfully. Jia-Sheng Heh 賀嘉生 2000 學位論文 ; thesis 39 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 中原大學 === 資訊工程學系 === 88 === Information retrievals, such as Boolean, vector and probability models, use index terms to get the documents of related contents. However, the relationships among index terms are usually ignored. Such relationships, no matter whether semantic or quantitative, may be valuable within retrievals.
This paper utilized the quantitative associations of index terms to form a weighted undirected graph, called TAG (term association graph). A TAG can be obtained from the association rules of a structured document. A structured document consisting of structures such as chapter, section, heading and hyperlink can also be represented as an undirected graph, called Structure Document Graph (SDG). For any document, the index term content can be obtained from its SDG. Then the corresponding TAG for this document can be calculated under the criterion of minimal accumulated association. When applied to hypertext information retrieval, TAG can be used to find the similarities of index terms, then the similarity between query and document.
A term association-based hypertext information retrieval system is established to implement our idea. The web documents are translated into SDGs; subsequently, the corresponding TAGs can be found and then clustered in several groups. Real examples prove that such system can retrieve related documents successfully.
|
author2 |
Jia-Sheng Heh |
author_facet |
Jia-Sheng Heh Shao-Chun Li 李紹群 |
author |
Shao-Chun Li 李紹群 |
spellingShingle |
Shao-Chun Li 李紹群 Term Association-based Hypertext Information Retrieval |
author_sort |
Shao-Chun Li |
title |
Term Association-based Hypertext Information Retrieval |
title_short |
Term Association-based Hypertext Information Retrieval |
title_full |
Term Association-based Hypertext Information Retrieval |
title_fullStr |
Term Association-based Hypertext Information Retrieval |
title_full_unstemmed |
Term Association-based Hypertext Information Retrieval |
title_sort |
term association-based hypertext information retrieval |
publishDate |
2000 |
url |
http://ndltd.ncl.edu.tw/handle/24024970252713456332 |
work_keys_str_mv |
AT shaochunli termassociationbasedhypertextinformationretrieval AT lǐshàoqún termassociationbasedhypertextinformationretrieval AT shaochunli yǐguānjiànzìxiāngguānxìngwèijīchǔzhīchāoběnwénzīxùnjiǎnsuǒxìtǒng AT lǐshàoqún yǐguānjiànzìxiāngguānxìngwèijīchǔzhīchāoběnwénzīxùnjiǎnsuǒxìtǒng |
_version_ |
1716849549789626368 |