Approaching Google Ranking with Semantically Related Terms

碩士 === 元智大學 === 資訊管理學系 === 99 === This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LS...

Full description

Bibliographic Details
Main Authors: Chun-Ju Li, 李淳如
Other Authors: Cheng-Jye Luh
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/75652668585464020794
id ndltd-TW-099YZU05396071
record_format oai_dc
spelling ndltd-TW-099YZU053960712016-04-13T04:17:16Z http://ndltd.ncl.edu.tw/handle/75652668585464020794 Approaching Google Ranking with Semantically Related Terms 運用語意相關詞來推估Google搜尋引擎的排名 Chun-Ju Li 李淳如 碩士 元智大學 資訊管理學系 99 This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) two approaches. Secondly we calculated the scores for keywords in title, keyword in snippet and keyword in URL for obtaining a document score. Several experiments were conducted on different combination of number of semantically related terms, number of documents, uni-gram and n-gram tokenization method, 1 topic and 2 topics of semantically related terms. The experimental results showed the average R-Precision reaches 0.8, indicating the ranking results of the proposed method approximates to Google results. Cheng-Jye Luh 陸承志 2011 學位論文 ; thesis 47 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊管理學系 === 99 === This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) two approaches. Secondly we calculated the scores for keywords in title, keyword in snippet and keyword in URL for obtaining a document score. Several experiments were conducted on different combination of number of semantically related terms, number of documents, uni-gram and n-gram tokenization method, 1 topic and 2 topics of semantically related terms. The experimental results showed the average R-Precision reaches 0.8, indicating the ranking results of the proposed method approximates to Google results.
author2 Cheng-Jye Luh
author_facet Cheng-Jye Luh
Chun-Ju Li
李淳如
author Chun-Ju Li
李淳如
spellingShingle Chun-Ju Li
李淳如
Approaching Google Ranking with Semantically Related Terms
author_sort Chun-Ju Li
title Approaching Google Ranking with Semantically Related Terms
title_short Approaching Google Ranking with Semantically Related Terms
title_full Approaching Google Ranking with Semantically Related Terms
title_fullStr Approaching Google Ranking with Semantically Related Terms
title_full_unstemmed Approaching Google Ranking with Semantically Related Terms
title_sort approaching google ranking with semantically related terms
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/75652668585464020794
work_keys_str_mv AT chunjuli approachinggooglerankingwithsemanticallyrelatedterms
AT lǐchúnrú approachinggooglerankingwithsemanticallyrelatedterms
AT chunjuli yùnyòngyǔyìxiāngguāncíláituīgūgooglesōuxúnyǐnqíngdepáimíng
AT lǐchúnrú yùnyòngyǔyìxiāngguāncíláituīgūgooglesōuxúnyǐnqíngdepáimíng
_version_ 1718222595111780352