Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity
The using of Twitter by selebrities has become a new trend of impression management strategy. Mining public reaction in social media is a good strategy to obtain feedbacks, but extracting it are not trivial matter. Reads hundred of tweets while determine their sentiment polarity are time consuming....
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Gadjah Mada
2016-07-01
|
Series: | IJCCS (Indonesian Journal of Computing and Cybernetics Systems) |
Subjects: | |
Online Access: | https://jurnal.ugm.ac.id/ijccs/article/view/16625 |
id |
doaj-9103fcd28df0428f95f58be5c1783aba |
---|---|
record_format |
Article |
spelling |
doaj-9103fcd28df0428f95f58be5c1783aba2020-11-24T20:55:03ZengUniversitas Gadjah MadaIJCCS (Indonesian Journal of Computing and Cybernetics Systems)1978-15202460-72582016-07-0110220721810.22146/ijccs.1662512159Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine SimilarityDevid Haryalesmana WahidAzhari SNThe using of Twitter by selebrities has become a new trend of impression management strategy. Mining public reaction in social media is a good strategy to obtain feedbacks, but extracting it are not trivial matter. Reads hundred of tweets while determine their sentiment polarity are time consuming. Extractive sentiment summarization machine are needed to address this issue. Previous research generally do not include sentiment information contained in a tweet as weight factor, as a results only general topics of discussion are extracted. This research aimed to do an extractive sentiment summarization on both positive and negative sentiment mentioning Indonesian selebrity, Agnes Monica, by combining SentiStrength, Hybrid TF-IDF, and Cosine Similarity. SentiStrength is used to obtain sentiment strength score and classify tweet as a positive, negative or neutral. The summarization of posisitve and negative sentiment can be done by rank tweets using Hybrid TF-IDF summarization and sentiment strength score as additional weight then removing similar tweet by using Cosine Similarity. The test results showed that the combination of SentiStrength, Hybrid TF-IDF, and Cosine Similarity perform better than using Hybrid TF-IDF only, given an average 60% accuracy and 62% f-measure. This is due to the addition of sentiment score as a weight factor in sentiment summarization.https://jurnal.ugm.ac.id/ijccs/article/view/16625extractive sentiment summarization, sentiment analysist, classification, automatic text summarization, SentiStrength, Hybrid TF-IDF |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Devid Haryalesmana Wahid Azhari SN |
spellingShingle |
Devid Haryalesmana Wahid Azhari SN Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity IJCCS (Indonesian Journal of Computing and Cybernetics Systems) extractive sentiment summarization, sentiment analysist, classification, automatic text summarization, SentiStrength, Hybrid TF-IDF |
author_facet |
Devid Haryalesmana Wahid Azhari SN |
author_sort |
Devid Haryalesmana Wahid |
title |
Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity |
title_short |
Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity |
title_full |
Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity |
title_fullStr |
Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity |
title_full_unstemmed |
Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity |
title_sort |
peringkasan sentimen esktraktif di twitter menggunakan hybrid tf-idf dan cosine similarity |
publisher |
Universitas Gadjah Mada |
series |
IJCCS (Indonesian Journal of Computing and Cybernetics Systems) |
issn |
1978-1520 2460-7258 |
publishDate |
2016-07-01 |
description |
The using of Twitter by selebrities has become a new trend of impression management strategy. Mining public reaction in social media is a good strategy to obtain feedbacks, but extracting it are not trivial matter. Reads hundred of tweets while determine their sentiment polarity are time consuming. Extractive sentiment summarization machine are needed to address this issue. Previous research generally do not include sentiment information contained in a tweet as weight factor, as a results only general topics of discussion are extracted.
This research aimed to do an extractive sentiment summarization on both positive and negative sentiment mentioning Indonesian selebrity, Agnes Monica, by combining SentiStrength, Hybrid TF-IDF, and Cosine Similarity. SentiStrength is used to obtain sentiment strength score and classify tweet as a positive, negative or neutral. The summarization of posisitve and negative sentiment can be done by rank tweets using Hybrid TF-IDF summarization and sentiment strength score as additional weight then removing similar tweet by using Cosine Similarity.
The test results showed that the combination of SentiStrength, Hybrid TF-IDF, and Cosine Similarity perform better than using Hybrid TF-IDF only, given an average 60% accuracy and 62% f-measure. This is due to the addition of sentiment score as a weight factor in sentiment summarization. |
topic |
extractive sentiment summarization, sentiment analysist, classification, automatic text summarization, SentiStrength, Hybrid TF-IDF |
url |
https://jurnal.ugm.ac.id/ijccs/article/view/16625 |
work_keys_str_mv |
AT devidharyalesmanawahid peringkasansentimenesktraktifditwittermenggunakanhybridtfidfdancosinesimilarity AT azharisn peringkasansentimenesktraktifditwittermenggunakanhybridtfidfdancosinesimilarity |
_version_ |
1716792771010887680 |