Applying Text Mining Techniques to Extract Keywords of Homestay Articles

碩士 === 輔仁大學 === 資訊管理學系 === 98 === In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style servic...

Full description

Bibliographic Details
Main Authors: Ko,Wei-Ting, 柯威廷
Other Authors: 翁頌舜
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/70442823355506415150
id ndltd-TW-098FJU00396041
record_format oai_dc
spelling ndltd-TW-098FJU003960412015-10-13T18:16:46Z http://ndltd.ncl.edu.tw/handle/70442823355506415150 Applying Text Mining Techniques to Extract Keywords of Homestay Articles 利用文字探勘技術萃取民宿文章關鍵字 Ko,Wei-Ting 柯威廷 碩士 輔仁大學 資訊管理學系 98 In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style services, homestay industry is approachable and has a friendly way to get along for ease of modern life so that people can pursue a higher degree of leisure life. As the shortage of information of accommodation sources, information is not unilaterally offered by the homestay industry but is dependent on tourists who wrote online articles to deliver the information. How to filter out the information to represent homestay industry would be a worthwhile research issues. In this study, latent semantic analysis (LSA) and singular value decomposition (SVD) are used to extract keywords to classify the homestay articles as well as make the tagging for articles. In such a way users can find keywords easier for classification of the areas of concern. In this study, the keyword extraction methods are used to identify keyword models that are close to social tagging in order to give appropriate recommended taggings. In this study, if the same words appear in different articles more often, they will have higher correlation values in the articles they do not appear. Compare the keywords extracted by LSA, and those extracted by TF-IDF weighted method, the precision and recall values of LSA are better than those of TF-IDF so that LSA is more suitable to judge the keywords of artilcles. In this study, the precision and recall vlues obtained are up to a certain accuracy level so that it helps articles to establish close social homestay tags of the keywords. With the results, it shows that users can search information and references easier with the help of the methods applied in this study. 翁頌舜 2010 學位論文 ; thesis 91 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 輔仁大學 === 資訊管理學系 === 98 === In recent years, because of the number of small family increases, the travel trends have changed, and the government vigorously promotes the homestay industry, so that the number of homestay has increased year by year, but compared to the standard hotel-style services, homestay industry is approachable and has a friendly way to get along for ease of modern life so that people can pursue a higher degree of leisure life. As the shortage of information of accommodation sources, information is not unilaterally offered by the homestay industry but is dependent on tourists who wrote online articles to deliver the information. How to filter out the information to represent homestay industry would be a worthwhile research issues. In this study, latent semantic analysis (LSA) and singular value decomposition (SVD) are used to extract keywords to classify the homestay articles as well as make the tagging for articles. In such a way users can find keywords easier for classification of the areas of concern. In this study, the keyword extraction methods are used to identify keyword models that are close to social tagging in order to give appropriate recommended taggings. In this study, if the same words appear in different articles more often, they will have higher correlation values in the articles they do not appear. Compare the keywords extracted by LSA, and those extracted by TF-IDF weighted method, the precision and recall values of LSA are better than those of TF-IDF so that LSA is more suitable to judge the keywords of artilcles. In this study, the precision and recall vlues obtained are up to a certain accuracy level so that it helps articles to establish close social homestay tags of the keywords. With the results, it shows that users can search information and references easier with the help of the methods applied in this study.
author2 翁頌舜
author_facet 翁頌舜
Ko,Wei-Ting
柯威廷
author Ko,Wei-Ting
柯威廷
spellingShingle Ko,Wei-Ting
柯威廷
Applying Text Mining Techniques to Extract Keywords of Homestay Articles
author_sort Ko,Wei-Ting
title Applying Text Mining Techniques to Extract Keywords of Homestay Articles
title_short Applying Text Mining Techniques to Extract Keywords of Homestay Articles
title_full Applying Text Mining Techniques to Extract Keywords of Homestay Articles
title_fullStr Applying Text Mining Techniques to Extract Keywords of Homestay Articles
title_full_unstemmed Applying Text Mining Techniques to Extract Keywords of Homestay Articles
title_sort applying text mining techniques to extract keywords of homestay articles
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/70442823355506415150
work_keys_str_mv AT koweiting applyingtextminingtechniquestoextractkeywordsofhomestayarticles
AT kēwēitíng applyingtextminingtechniquestoextractkeywordsofhomestayarticles
AT koweiting lìyòngwénzìtànkānjìshùcuìqǔmínsùwénzhāngguānjiànzì
AT kēwēitíng lìyòngwénzìtànkānjìshùcuìqǔmínsùwénzhāngguānjiànzì
_version_ 1718029684682260480