An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
How to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entitie...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-05-01
|
Series: | Sustainability |
Subjects: | |
Online Access: | http://www.mdpi.com/2071-1050/10/5/1425 |
id |
doaj-1b304d055ea949abb552710cea56b08f |
---|---|
record_format |
Article |
spelling |
doaj-1b304d055ea949abb552710cea56b08f2020-11-24T20:59:05ZengMDPI AGSustainability2071-10502018-05-01105142510.3390/su10051425su10051425An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer ReviewsChen Liu0Li Tang1Wei Shan2Business School, University of Shanghai for Science and Technology, Shanghai 200093, ChinaBusiness School, University of Shanghai for Science and Technology, Shanghai 200093, ChinaSchool of Economics and Management, Beihang University, Beijing 100191, ChinaHow to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entities in consumer reviews. This paper first constitutes a directed bipartite feature-sentiment relation network with a set of candidate features-sentiment pairs that is extracted by dependency syntax analysis from consumer reviews. Then, a novel method called MHITS which combines PMI with weighted HITS algorithm is proposed to rank these candidate product features to find out real product features. Empirical experiments indicate the effectiveness of our approach across different kinds and various data sizes of product. In addition, the effect of the proposed algorithm is not the same for the corpus with different proportions of the word pair that includes the “bad”, “good”, “poor”, “pretty good”, “not bad” these general collocation words.http://www.mdpi.com/2071-1050/10/5/1425opinion miningfeature extractionbipartite networkextended HITS algorithm |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chen Liu Li Tang Wei Shan |
spellingShingle |
Chen Liu Li Tang Wei Shan An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews Sustainability opinion mining feature extraction bipartite network extended HITS algorithm |
author_facet |
Chen Liu Li Tang Wei Shan |
author_sort |
Chen Liu |
title |
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews |
title_short |
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews |
title_full |
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews |
title_fullStr |
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews |
title_full_unstemmed |
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews |
title_sort |
extended hits algorithm on bipartite network for features extraction of online customer reviews |
publisher |
MDPI AG |
series |
Sustainability |
issn |
2071-1050 |
publishDate |
2018-05-01 |
description |
How to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entities in consumer reviews. This paper first constitutes a directed bipartite feature-sentiment relation network with a set of candidate features-sentiment pairs that is extracted by dependency syntax analysis from consumer reviews. Then, a novel method called MHITS which combines PMI with weighted HITS algorithm is proposed to rank these candidate product features to find out real product features. Empirical experiments indicate the effectiveness of our approach across different kinds and various data sizes of product. In addition, the effect of the proposed algorithm is not the same for the corpus with different proportions of the word pair that includes the “bad”, “good”, “poor”, “pretty good”, “not bad” these general collocation words. |
topic |
opinion mining feature extraction bipartite network extended HITS algorithm |
url |
http://www.mdpi.com/2071-1050/10/5/1425 |
work_keys_str_mv |
AT chenliu anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews AT litang anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews AT weishan anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews AT chenliu extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews AT litang extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews AT weishan extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews |
_version_ |
1716783804661628928 |