Applying Feature Opinion Pairs in Product Review Classification

碩士 === 雲林科技大學 === 資訊管理系碩士班 === 98 === With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mi...

Full description

Bibliographic Details
Main Authors: Po-shun Liao, 廖柏舜
Other Authors: none
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/94233843351678785540
id ndltd-TW-098YUNT5396035
record_format oai_dc
spelling ndltd-TW-098YUNT53960352015-10-13T18:58:56Z http://ndltd.ncl.edu.tw/handle/94233843351678785540 Applying Feature Opinion Pairs in Product Review Classification 利用特徵意見配對擷取法應用在產品評論分類 Po-shun Liao 廖柏舜 碩士 雲林科技大學 資訊管理系碩士班 98 With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mixed with positive and negative comments, if processing manually to obtain information which is useful, it must spend a lot of energy and time. In text mining, document clustering can classify files to solve the current issue of disordering file on the Internet. When the use of text mining, the document will go through pre-processing of documents into a number of keywords to represent documents, and then converted into numerical values. General approach is to use vector space model, converted into vector according to each of the keywords, then use the clustering algorithm to cluster documents into categories. However, the use of vector space model, it is usually accompanied by excessive noise, making the effectiveness document clustering affected. In this study, feature representation method was improved by using feature-opinion pairs to represent the features of vector space model, so the feature representation method can contain more semantic information, and then generate the vector space model into classification to classify product reviews. The experiment results shows that the proposed feature representation method was successfully increase the five-way classification accuracy, in addition to the Average accuracy of setting root as Token feature for the extraction rule has a significant improvement, the accuracy in each star rating have a slightly increased. This study will be effectively add text polarity into the feature which makes the data distribution in feature space more clearly none 施學琦 2010 學位論文 ; thesis 58 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 雲林科技大學 === 資訊管理系碩士班 === 98 === With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mixed with positive and negative comments, if processing manually to obtain information which is useful, it must spend a lot of energy and time. In text mining, document clustering can classify files to solve the current issue of disordering file on the Internet. When the use of text mining, the document will go through pre-processing of documents into a number of keywords to represent documents, and then converted into numerical values. General approach is to use vector space model, converted into vector according to each of the keywords, then use the clustering algorithm to cluster documents into categories. However, the use of vector space model, it is usually accompanied by excessive noise, making the effectiveness document clustering affected. In this study, feature representation method was improved by using feature-opinion pairs to represent the features of vector space model, so the feature representation method can contain more semantic information, and then generate the vector space model into classification to classify product reviews. The experiment results shows that the proposed feature representation method was successfully increase the five-way classification accuracy, in addition to the Average accuracy of setting root as Token feature for the extraction rule has a significant improvement, the accuracy in each star rating have a slightly increased. This study will be effectively add text polarity into the feature which makes the data distribution in feature space more clearly
author2 none
author_facet none
Po-shun Liao
廖柏舜
author Po-shun Liao
廖柏舜
spellingShingle Po-shun Liao
廖柏舜
Applying Feature Opinion Pairs in Product Review Classification
author_sort Po-shun Liao
title Applying Feature Opinion Pairs in Product Review Classification
title_short Applying Feature Opinion Pairs in Product Review Classification
title_full Applying Feature Opinion Pairs in Product Review Classification
title_fullStr Applying Feature Opinion Pairs in Product Review Classification
title_full_unstemmed Applying Feature Opinion Pairs in Product Review Classification
title_sort applying feature opinion pairs in product review classification
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/94233843351678785540
work_keys_str_mv AT poshunliao applyingfeatureopinionpairsinproductreviewclassification
AT liàobǎishùn applyingfeatureopinionpairsinproductreviewclassification
AT poshunliao lìyòngtèzhēngyìjiànpèiduìxiéqǔfǎyīngyòngzàichǎnpǐnpínglùnfēnlèi
AT liàobǎishùn lìyòngtèzhēngyìjiànpèiduìxiéqǔfǎyīngyòngzàichǎnpǐnpínglùnfēnlèi
_version_ 1718039675063500800