Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single fe...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9212374/ |
id |
doaj-7ee943acb3d640dcb5caab8246cdcb5d |
---|---|
record_format |
Article |
spelling |
doaj-7ee943acb3d640dcb5caab8246cdcb5d2021-03-30T04:23:14ZengIEEEIEEE Access2169-35362020-01-01818262518263910.1109/ACCESS.2020.30285889212374Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative TrainingJingdong Wang0https://orcid.org/0000-0001-7037-951XHaitao Kan1https://orcid.org/0000-0002-6493-4668Fanqi Meng2Qizi Mu3https://orcid.org/0000-0001-7079-4182Genhua Shi4Xixi Xiao5https://orcid.org/0000-0002-1317-7820School of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaFake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection.https://ieeexplore.ieee.org/document/9212374/Fake review detectionmachine learningmultiple feature fusionfeature extractionrolling collaborative training |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao |
spellingShingle |
Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training IEEE Access Fake review detection machine learning multiple feature fusion feature extraction rolling collaborative training |
author_facet |
Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao |
author_sort |
Jingdong Wang |
title |
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training |
title_short |
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training |
title_full |
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training |
title_fullStr |
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training |
title_full_unstemmed |
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training |
title_sort |
fake review detection based on multiple feature fusion and rolling collaborative training |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection. |
topic |
Fake review detection machine learning multiple feature fusion feature extraction rolling collaborative training |
url |
https://ieeexplore.ieee.org/document/9212374/ |
work_keys_str_mv |
AT jingdongwang fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT haitaokan fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT fanqimeng fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT qizimu fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT genhuashi fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT xixixiao fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining |
_version_ |
1724181889071185920 |