Helpfulness Analysis for Movie Reviews

碩士 === 國立臺灣師範大學 === 資訊工程學系 === 106 === With the rapid development of the Internet, huge amount of information is an inevitable trend, which also includes lots of user comments. Many reviews do not include useful information, so extracting helpful comments from a large number of user reviews is the r...

Full description

Bibliographic Details
Main Authors: Hsu, Chih-Ting, 徐志廷
Other Authors: Hou, Wen-Chuan
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/49p4wv
id ndltd-TW-106NTNU5392004
record_format oai_dc
spelling ndltd-TW-106NTNU53920042019-05-16T00:15:35Z http://ndltd.ncl.edu.tw/handle/49p4wv Helpfulness Analysis for Movie Reviews 電影評論之助益性分析研究 Hsu, Chih-Ting 徐志廷 碩士 國立臺灣師範大學 資訊工程學系 106 With the rapid development of the Internet, huge amount of information is an inevitable trend, which also includes lots of user comments. Many reviews do not include useful information, so extracting helpful comments from a large number of user reviews is the research goal of this paper. There is no standard definition of review helpfulness, and as long as if it helps users to think about it, it can be helpful. Therefore, this study attempts to give comments by the characteristics of scores , as a basis for judgment. This thesis takes the short stories of Yahoo movie as the research target.The study uses the CKIP (Chinese Knowledge Information Processing) to process the comments first, and then find out the TFIDF keywords, parts of speech and lengths of comments from the data. The TFIDF keyword are expanded to synonyms and antonyms by the online dictionary of Ministry of Education. NTUSD (National Taiwan University Semantic Dictionary) was used built by National Taiwan University to find out the sentiment words contained in each comment and to calculate the number of sentiment words. Using SVM training model and prediction results, the accuracy of 79.7% was obtained. Hou, Wen-Chuan 侯文娟 2018 學位論文 ; thesis 45 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣師範大學 === 資訊工程學系 === 106 === With the rapid development of the Internet, huge amount of information is an inevitable trend, which also includes lots of user comments. Many reviews do not include useful information, so extracting helpful comments from a large number of user reviews is the research goal of this paper. There is no standard definition of review helpfulness, and as long as if it helps users to think about it, it can be helpful. Therefore, this study attempts to give comments by the characteristics of scores , as a basis for judgment. This thesis takes the short stories of Yahoo movie as the research target.The study uses the CKIP (Chinese Knowledge Information Processing) to process the comments first, and then find out the TFIDF keywords, parts of speech and lengths of comments from the data. The TFIDF keyword are expanded to synonyms and antonyms by the online dictionary of Ministry of Education. NTUSD (National Taiwan University Semantic Dictionary) was used built by National Taiwan University to find out the sentiment words contained in each comment and to calculate the number of sentiment words. Using SVM training model and prediction results, the accuracy of 79.7% was obtained.
author2 Hou, Wen-Chuan
author_facet Hou, Wen-Chuan
Hsu, Chih-Ting
徐志廷
author Hsu, Chih-Ting
徐志廷
spellingShingle Hsu, Chih-Ting
徐志廷
Helpfulness Analysis for Movie Reviews
author_sort Hsu, Chih-Ting
title Helpfulness Analysis for Movie Reviews
title_short Helpfulness Analysis for Movie Reviews
title_full Helpfulness Analysis for Movie Reviews
title_fullStr Helpfulness Analysis for Movie Reviews
title_full_unstemmed Helpfulness Analysis for Movie Reviews
title_sort helpfulness analysis for movie reviews
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/49p4wv
work_keys_str_mv AT hsuchihting helpfulnessanalysisformoviereviews
AT xúzhìtíng helpfulnessanalysisformoviereviews
AT hsuchihting diànyǐngpínglùnzhīzhùyìxìngfēnxīyánjiū
AT xúzhìtíng diànyǐngpínglùnzhīzhùyìxìngfēnxīyánjiū
_version_ 1719163403387273216