Applying Feature Opinion Pairs in Product Review Classification
碩士 === 雲林科技大學 === 資訊管理系碩士班 === 98 === With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/94233843351678785540 |
id |
ndltd-TW-098YUNT5396035 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098YUNT53960352015-10-13T18:58:56Z http://ndltd.ncl.edu.tw/handle/94233843351678785540 Applying Feature Opinion Pairs in Product Review Classification 利用特徵意見配對擷取法應用在產品評論分類 Po-shun Liao 廖柏舜 碩士 雲林科技大學 資訊管理系碩士班 98 With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mixed with positive and negative comments, if processing manually to obtain information which is useful, it must spend a lot of energy and time. In text mining, document clustering can classify files to solve the current issue of disordering file on the Internet. When the use of text mining, the document will go through pre-processing of documents into a number of keywords to represent documents, and then converted into numerical values. General approach is to use vector space model, converted into vector according to each of the keywords, then use the clustering algorithm to cluster documents into categories. However, the use of vector space model, it is usually accompanied by excessive noise, making the effectiveness document clustering affected. In this study, feature representation method was improved by using feature-opinion pairs to represent the features of vector space model, so the feature representation method can contain more semantic information, and then generate the vector space model into classification to classify product reviews. The experiment results shows that the proposed feature representation method was successfully increase the five-way classification accuracy, in addition to the Average accuracy of setting root as Token feature for the extraction rule has a significant improvement, the accuracy in each star rating have a slightly increased. This study will be effectively add text polarity into the feature which makes the data distribution in feature space more clearly none 施學琦 2010 學位論文 ; thesis 58 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 雲林科技大學 === 資訊管理系碩士班 === 98 === With the rapid development of Internet, the rise of electronic commerce, and the extensive use of web2.0 technology, more and more people expresses their views of products and services on the Internet. However the Web''s comments are usually mixed with positive and negative comments, if processing manually to obtain information which is useful, it must spend a lot of energy and time. In text mining, document clustering can classify files to solve the current issue of disordering file on the Internet. When the use of text mining, the document will go through pre-processing of documents into a number of keywords to represent documents, and then converted into numerical values. General approach is to use vector space model, converted into vector according to each of the keywords, then use the clustering algorithm to cluster documents into categories. However, the use of vector space model, it is usually accompanied by excessive noise, making the effectiveness document clustering affected. In this study, feature representation method was improved by using feature-opinion pairs to represent the features of vector space model, so the feature representation method can contain more semantic information, and then generate the vector space model into classification to classify product reviews. The experiment results shows that the proposed feature representation method was successfully increase the five-way classification accuracy, in addition to the Average accuracy of setting root as Token feature for the extraction rule has a significant improvement, the accuracy in each star rating have a slightly increased. This study will be effectively add text polarity into the feature which makes the data distribution in feature space more clearly
|
author2 |
none |
author_facet |
none Po-shun Liao 廖柏舜 |
author |
Po-shun Liao 廖柏舜 |
spellingShingle |
Po-shun Liao 廖柏舜 Applying Feature Opinion Pairs in Product Review Classification |
author_sort |
Po-shun Liao |
title |
Applying Feature Opinion Pairs in Product Review Classification |
title_short |
Applying Feature Opinion Pairs in Product Review Classification |
title_full |
Applying Feature Opinion Pairs in Product Review Classification |
title_fullStr |
Applying Feature Opinion Pairs in Product Review Classification |
title_full_unstemmed |
Applying Feature Opinion Pairs in Product Review Classification |
title_sort |
applying feature opinion pairs in product review classification |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/94233843351678785540 |
work_keys_str_mv |
AT poshunliao applyingfeatureopinionpairsinproductreviewclassification AT liàobǎishùn applyingfeatureopinionpairsinproductreviewclassification AT poshunliao lìyòngtèzhēngyìjiànpèiduìxiéqǔfǎyīngyòngzàichǎnpǐnpínglùnfēnlèi AT liàobǎishùn lìyòngtèzhēngyìjiànpèiduìxiéqǔfǎyīngyòngzàichǎnpǐnpínglùnfēnlèi |
_version_ |
1718039675063500800 |