Classification and Visualisation of Twitter Sentiment Data

The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking adv...

Full description

Bibliographic Details
Main Authors:	Selmer, Oyvind, Brevik, Mikael
Format:	Others
Language:	English
Published:	Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap 2013
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967

id	ndltd-UPSALLA1-oai-DiVA.org-ntnu-22967
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-ntnu-229672013-10-13T04:36:54ZClassification and Visualisation of Twitter Sentiment DataengSelmer, OyvindBrevik, MikaelNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2013The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967Local ntnudaim:9708application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
description	The social micro-blog site Twitter grows in user base each day and has become an attractive platform for companies, politicians, marketeers, and others wishing to share information and/or opinions. With a growing user market for Twitter, more and more systems and research are released for taking advantage of its informal nature and doing opinion mining and sentiment analysis. This master thesis describes a system for doing Sentiment Analysis on Twitter data and experiments with grid searches on various combinations of machine learning algorithms, features and preprocessing methods to achieve so. The classification system is fairly domain independent and performs better than baseline. This system is designed to be fast enough to classify big amounts of data and tweets in a stream, and provides an application program interface (API) to easily transfer data to applications or end users. Three visualisation applications are implemented, showing how to use the API and providing examples of how sentiment data can be used.The main contributions are: C1: A literary study of the state-of-the-art for Twitter Sentiment Analysis.C2: The implementation of a general system architecture for doing Twitter Sentiment Analysis. C3: A comparison of different machine learning algorithms for the task of identifying sentiments in short messages in a fairly semi-independent domain.C4: Implementations of a set of visualisation applications, showing how to use data from the generic system and providing examples of how to present sentiment analysis data.
author	Selmer, Oyvind Brevik, Mikael
spellingShingle	Selmer, Oyvind Brevik, Mikael Classification and Visualisation of Twitter Sentiment Data
author_facet	Selmer, Oyvind Brevik, Mikael
author_sort	Selmer, Oyvind
title	Classification and Visualisation of Twitter Sentiment Data
title_short	Classification and Visualisation of Twitter Sentiment Data
title_full	Classification and Visualisation of Twitter Sentiment Data
title_fullStr	Classification and Visualisation of Twitter Sentiment Data
title_full_unstemmed	Classification and Visualisation of Twitter Sentiment Data
title_sort	classification and visualisation of twitter sentiment data
publisher	Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
publishDate	2013
url	http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-22967
work_keys_str_mv	AT selmeroyvind classificationandvisualisationoftwittersentimentdata AT brevikmikael classificationandvisualisationoftwittersentimentdata
_version_	1716604526700527616

Classification and Visualisation of Twitter Sentiment Data

Similar Items