Applying Text Mining Techniques to Extract Keywords of Homestay Articles
碩士 === 輔仁大學 === 資訊管理學系 === 98 === In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style servic...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/70442823355506415150 |
id |
ndltd-TW-098FJU00396041 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098FJU003960412015-10-13T18:16:46Z http://ndltd.ncl.edu.tw/handle/70442823355506415150 Applying Text Mining Techniques to Extract Keywords of Homestay Articles 利用文字探勘技術萃取民宿文章關鍵字 Ko,Wei-Ting 柯威廷 碩士 輔仁大學 資訊管理學系 98 In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style services, homestay industry is approachable and has a friendly way to get along for ease of modern life so that people can pursue a higher degree of leisure life. As the shortage of information of accommodation sources, information is not unilaterally offered by the homestay industry but is dependent on tourists who wrote online articles to deliver the information. How to filter out the information to represent homestay industry would be a worthwhile research issues. In this study, latent semantic analysis (LSA) and singular value decomposition (SVD) are used to extract keywords to classify the homestay articles as well as make the tagging for articles. In such a way users can find keywords easier for classification of the areas of concern. In this study, the keyword extraction methods are used to identify keyword models that are close to social tagging in order to give appropriate recommended taggings. In this study, if the same words appear in different articles more often, they will have higher correlation values in the articles they do not appear. Compare the keywords extracted by LSA, and those extracted by TF-IDF weighted method, the precision and recall values of LSA are better than those of TF-IDF so that LSA is more suitable to judge the keywords of artilcles. In this study, the precision and recall vlues obtained are up to a certain accuracy level so that it helps articles to establish close social homestay tags of the keywords. With the results, it shows that users can search information and references easier with the help of the methods applied in this study. 翁頌舜 2010 學位論文 ; thesis 91 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 輔仁大學 === 資訊管理學系 === 98 === In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style services, homestay industry is approachable and has a friendly way to get along for ease of modern life so that people can pursue a higher degree of leisure life. As the shortage of information of accommodation sources, information is not unilaterally offered by the homestay industry but is dependent on tourists who wrote online articles to deliver the information. How to filter out the information to represent homestay industry would be a worthwhile research issues.
In this study, latent semantic analysis (LSA) and singular value decomposition (SVD) are used to extract keywords to classify the homestay articles as well as make the tagging for articles. In such a way users can find keywords easier for classification of the areas of concern. In this study, the keyword extraction methods are used to identify keyword models that are close to social tagging in order to give appropriate recommended taggings.
In this study, if the same words appear in different articles more often, they will have higher correlation values in the articles they do not appear. Compare the keywords extracted by LSA, and those extracted by TF-IDF weighted method, the precision and recall values of LSA are better than those of TF-IDF so that LSA is more suitable to judge the keywords of artilcles. In this study, the precision and recall vlues obtained are up to a certain accuracy level so that it helps articles to establish close social homestay tags of the keywords. With the results, it shows that users can search information and references easier with the help of the methods applied in this study.
|
author2 |
翁頌舜 |
author_facet |
翁頌舜 Ko,Wei-Ting 柯威廷 |
author |
Ko,Wei-Ting 柯威廷 |
spellingShingle |
Ko,Wei-Ting 柯威廷 Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
author_sort |
Ko,Wei-Ting |
title |
Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
title_short |
Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
title_full |
Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
title_fullStr |
Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
title_full_unstemmed |
Applying Text Mining Techniques to Extract Keywords of Homestay Articles |
title_sort |
applying text mining techniques to extract keywords of homestay articles |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/70442823355506415150 |
work_keys_str_mv |
AT koweiting applyingtextminingtechniquestoextractkeywordsofhomestayarticles AT kēwēitíng applyingtextminingtechniquestoextractkeywordsofhomestayarticles AT koweiting lìyòngwénzìtànkānjìshùcuìqǔmínsùwénzhāngguānjiànzì AT kēwēitíng lìyòngwénzìtànkānjìshùcuìqǔmínsùwénzhāngguānjiànzì |
_version_ |
1718029684682260480 |