Approaching Google Ranking with Semantically Related Terms
碩士 === 元智大學 === 資訊管理學系 === 99 === This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LS...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/75652668585464020794 |
id |
ndltd-TW-099YZU05396071 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099YZU053960712016-04-13T04:17:16Z http://ndltd.ncl.edu.tw/handle/75652668585464020794 Approaching Google Ranking with Semantically Related Terms 運用語意相關詞來推估Google搜尋引擎的排名 Chun-Ju Li 李淳如 碩士 元智大學 資訊管理學系 99 This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) two approaches. Secondly we calculated the scores for keywords in title, keyword in snippet and keyword in URL for obtaining a document score. Several experiments were conducted on different combination of number of semantically related terms, number of documents, uni-gram and n-gram tokenization method, 1 topic and 2 topics of semantically related terms. The experimental results showed the average R-Precision reaches 0.8, indicating the ranking results of the proposed method approximates to Google results. Cheng-Jye Luh 陸承志 2011 學位論文 ; thesis 47 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊管理學系 === 99 === This study aims to approximate Google ranking results using semantically related terms of query. Firstly, we crawled and extracted web page title, snippet and URL from Google search results. Then we found semantically related terms using Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) two approaches. Secondly we calculated the scores for keywords in title, keyword in snippet and keyword in URL for obtaining a document score. Several experiments were conducted on different combination of number of semantically related terms, number of documents, uni-gram and n-gram tokenization method, 1 topic and 2 topics of semantically related terms. The experimental results showed the average R-Precision reaches 0.8, indicating the ranking results of the proposed method approximates to Google results.
|
author2 |
Cheng-Jye Luh |
author_facet |
Cheng-Jye Luh Chun-Ju Li 李淳如 |
author |
Chun-Ju Li 李淳如 |
spellingShingle |
Chun-Ju Li 李淳如 Approaching Google Ranking with Semantically Related Terms |
author_sort |
Chun-Ju Li |
title |
Approaching Google Ranking with Semantically Related Terms |
title_short |
Approaching Google Ranking with Semantically Related Terms |
title_full |
Approaching Google Ranking with Semantically Related Terms |
title_fullStr |
Approaching Google Ranking with Semantically Related Terms |
title_full_unstemmed |
Approaching Google Ranking with Semantically Related Terms |
title_sort |
approaching google ranking with semantically related terms |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/75652668585464020794 |
work_keys_str_mv |
AT chunjuli approachinggooglerankingwithsemanticallyrelatedterms AT lǐchúnrú approachinggooglerankingwithsemanticallyrelatedterms AT chunjuli yùnyòngyǔyìxiāngguāncíláituīgūgooglesōuxúnyǐnqíngdepáimíng AT lǐchúnrú yùnyòngyǔyìxiāngguāncíláituīgūgooglesōuxúnyǐnqíngdepáimíng |
_version_ |
1718222595111780352 |