An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews

How to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entitie...

Full description

Bibliographic Details
Main Authors: Chen Liu, Li Tang, Wei Shan
Format: Article
Language:English
Published: MDPI AG 2018-05-01
Series:Sustainability
Subjects:
Online Access:http://www.mdpi.com/2071-1050/10/5/1425
id doaj-1b304d055ea949abb552710cea56b08f
record_format Article
spelling doaj-1b304d055ea949abb552710cea56b08f2020-11-24T20:59:05ZengMDPI AGSustainability2071-10502018-05-01105142510.3390/su10051425su10051425An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer ReviewsChen Liu0Li Tang1Wei Shan2Business School, University of Shanghai for Science and Technology, Shanghai 200093, ChinaBusiness School, University of Shanghai for Science and Technology, Shanghai 200093, ChinaSchool of Economics and Management, Beihang University, Beijing 100191, ChinaHow to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entities in consumer reviews. This paper first constitutes a directed bipartite feature-sentiment relation network with a set of candidate features-sentiment pairs that is extracted by dependency syntax analysis from consumer reviews. Then, a novel method called MHITS which combines PMI with weighted HITS algorithm is proposed to rank these candidate product features to find out real product features. Empirical experiments indicate the effectiveness of our approach across different kinds and various data sizes of product. In addition, the effect of the proposed algorithm is not the same for the corpus with different proportions of the word pair that includes the “bad”, “good”, “poor”, “pretty good”, “not bad” these general collocation words.http://www.mdpi.com/2071-1050/10/5/1425opinion miningfeature extractionbipartite networkextended HITS algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Chen Liu
Li Tang
Wei Shan
spellingShingle Chen Liu
Li Tang
Wei Shan
An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
Sustainability
opinion mining
feature extraction
bipartite network
extended HITS algorithm
author_facet Chen Liu
Li Tang
Wei Shan
author_sort Chen Liu
title An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
title_short An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
title_full An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
title_fullStr An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
title_full_unstemmed An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews
title_sort extended hits algorithm on bipartite network for features extraction of online customer reviews
publisher MDPI AG
series Sustainability
issn 2071-1050
publishDate 2018-05-01
description How to acquire useful information intelligently in the age of information explosion has become an important issue. In this context, sentiment analysis emerges with the growth of the need of information extraction. One of the most important tasks of sentiment analysis is feature extraction of entities in consumer reviews. This paper first constitutes a directed bipartite feature-sentiment relation network with a set of candidate features-sentiment pairs that is extracted by dependency syntax analysis from consumer reviews. Then, a novel method called MHITS which combines PMI with weighted HITS algorithm is proposed to rank these candidate product features to find out real product features. Empirical experiments indicate the effectiveness of our approach across different kinds and various data sizes of product. In addition, the effect of the proposed algorithm is not the same for the corpus with different proportions of the word pair that includes the “bad”, “good”, “poor”, “pretty good”, “not bad” these general collocation words.
topic opinion mining
feature extraction
bipartite network
extended HITS algorithm
url http://www.mdpi.com/2071-1050/10/5/1425
work_keys_str_mv AT chenliu anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
AT litang anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
AT weishan anextendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
AT chenliu extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
AT litang extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
AT weishan extendedhitsalgorithmonbipartitenetworkforfeaturesextractionofonlinecustomerreviews
_version_ 1716783804661628928