Classification and Visualisation of Twitter Sentiment Data
The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking adv...
Main Authors: | , |
---|---|
Format: | Others |
Language: | English |
Published: |
Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
2013
|
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967 |
id |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-22967 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-229672013-10-13T04:36:54ZClassification and Visualisation of Twitter Sentiment DataengSelmer, OyvindBrevik, MikaelNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2013The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967Local ntnudaim:9708application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
description |
The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data. |
author |
Selmer, Oyvind Brevik, Mikael |
spellingShingle |
Selmer, Oyvind Brevik, Mikael Classification and Visualisation of Twitter Sentiment Data |
author_facet |
Selmer, Oyvind Brevik, Mikael |
author_sort |
Selmer, Oyvind |
title |
Classification and Visualisation of Twitter Sentiment Data |
title_short |
Classification and Visualisation of Twitter Sentiment Data |
title_full |
Classification and Visualisation of Twitter Sentiment Data |
title_fullStr |
Classification and Visualisation of Twitter Sentiment Data |
title_full_unstemmed |
Classification and Visualisation of Twitter Sentiment Data |
title_sort |
classification and visualisation of twitter sentiment data |
publisher |
Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap |
publishDate |
2013 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967 |
work_keys_str_mv |
AT selmeroyvind classificationandvisualisationoftwittersentimentdata AT brevikmikael classificationandvisualisationoftwittersentimentdata |
_version_ |
1716604526700527616 |