Classification and Visualisation of Twitter Sentiment Data

The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking adv...

Full description

Bibliographic Details
Main Authors: Selmer, Oyvind, Brevik, Mikael
Format: Others
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap 2013
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-22967
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-229672013-10-13T04:36:54ZClassification and Visualisation of Twitter Sentiment DataengSelmer, OyvindBrevik, MikaelNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2013The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967Local ntnudaim:9708application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data.
author Selmer, Oyvind
Brevik, Mikael
spellingShingle Selmer, Oyvind
Brevik, Mikael
Classification and Visualisation of Twitter Sentiment Data
author_facet Selmer, Oyvind
Brevik, Mikael
author_sort Selmer, Oyvind
title Classification and Visualisation of Twitter Sentiment Data
title_short Classification and Visualisation of Twitter Sentiment Data
title_full Classification and Visualisation of Twitter Sentiment Data
title_fullStr Classification and Visualisation of Twitter Sentiment Data
title_full_unstemmed Classification and Visualisation of Twitter Sentiment Data
title_sort classification and visualisation of twitter sentiment data
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
publishDate 2013
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967
work_keys_str_mv AT selmeroyvind classificationandvisualisationoftwittersentimentdata
AT brevikmikael classificationandvisualisationoftwittersentimentdata
_version_ 1716604526700527616