On social interaction metrics : social network crawling based on interestingness

With the high use of online social networks we are entering the era of big data. With limited resources it is important to evaluate and prioritize interesting data. This thesis addresses the following aspects of social network analysis: efficient data collection, social interaction evaluation and us...

Full description

Bibliographic Details
Main Author: Erlandsson, Fredrik
Format: Others
Language:English
Published: Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik 2014
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:bth-00596
http://nbn-resolving.de/urn:isbn:978-91-7295-287-4
id ndltd-UPSALLA1-oai-DiVA.org-bth-00596
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-bth-005962018-05-24T05:26:21ZOn social interaction metrics : social network crawling based on interestingnessengErlandsson, FredrikBlekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknikKarlskrona : Blekinge Institute of Technology2014Computer SciencesDatavetenskap (datalogi)Media and Communication TechnologyMedieteknikWith the high use of online social networks we are entering the era of big data. With limited resources it is important to evaluate and prioritize interesting data. This thesis addresses the following aspects of social network analysis: efficient data collection, social interaction evaluation and user privacy concerns. It is possible to collect data from online social networks via their open APIs. However, a systematic and efficient collection of online social networks data is still challenging. To improve the quality of the data collection process, prioritizing methods are statistically evaluated. Results suggest that the collection time can be reduced by up to 48% by prioritizing the collection of posts. Evaluation of social interactions also require data that covers all the interactions in a given domain. This has previously been hard to do, but the proposed crawler is capable of extracting all social interactions from a given page. With the extracted data it is for instance possible to illustrate indirect interactions between different users that do not necessarily have to be connected. Methods using the same data to identify and cluster different opinions in online communities have been developed. These methods are evaluated with the too Linguistic Inquiry and Word Count. The privacy of the content produced; and the users’ private information provided on social networks is important to protect. Users must be aware of the consequence of posting in online social networks in terms of privacy. Methods to protect user privacy are presented. The proposed crawler in this thesis has, over the period of 20 months, collected over 38 million posts from public pages on Facebook covering: 4 billion likes and 340 million comments from over 280 million users. The performed data collection yielded one of the largest research dataset of social interactions on Facebook today, enabling qualitative research in form of social network analysis. Licentiate thesis, comprehensive summaryinfo:eu-repo/semantics/masterThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:bth-00596urn:isbn:978-91-7295-287-4Local oai:bth.se:forskinfo0BC502A96D245C16C1257D32002E2B6CBlekinge Institute of Technology Licentiate Dissertation Series, 1650-2140 ; 6application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Computer Sciences
Datavetenskap (datalogi)
Media and Communication Technology
Medieteknik
spellingShingle Computer Sciences
Datavetenskap (datalogi)
Media and Communication Technology
Medieteknik
Erlandsson, Fredrik
On social interaction metrics : social network crawling based on interestingness
description With the high use of online social networks we are entering the era of big data. With limited resources it is important to evaluate and prioritize interesting data. This thesis addresses the following aspects of social network analysis: efficient data collection, social interaction evaluation and user privacy concerns. It is possible to collect data from online social networks via their open APIs. However, a systematic and efficient collection of online social networks data is still challenging. To improve the quality of the data collection process, prioritizing methods are statistically evaluated. Results suggest that the collection time can be reduced by up to 48% by prioritizing the collection of posts. Evaluation of social interactions also require data that covers all the interactions in a given domain. This has previously been hard to do, but the proposed crawler is capable of extracting all social interactions from a given page. With the extracted data it is for instance possible to illustrate indirect interactions between different users that do not necessarily have to be connected. Methods using the same data to identify and cluster different opinions in online communities have been developed. These methods are evaluated with the too Linguistic Inquiry and Word Count. The privacy of the content produced; and the users’ private information provided on social networks is important to protect. Users must be aware of the consequence of posting in online social networks in terms of privacy. Methods to protect user privacy are presented. The proposed crawler in this thesis has, over the period of 20 months, collected over 38 million posts from public pages on Facebook covering: 4 billion likes and 340 million comments from over 280 million users. The performed data collection yielded one of the largest research dataset of social interactions on Facebook today, enabling qualitative research in form of social network analysis.
author Erlandsson, Fredrik
author_facet Erlandsson, Fredrik
author_sort Erlandsson, Fredrik
title On social interaction metrics : social network crawling based on interestingness
title_short On social interaction metrics : social network crawling based on interestingness
title_full On social interaction metrics : social network crawling based on interestingness
title_fullStr On social interaction metrics : social network crawling based on interestingness
title_full_unstemmed On social interaction metrics : social network crawling based on interestingness
title_sort on social interaction metrics : social network crawling based on interestingness
publisher Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik
publishDate 2014
url http://urn.kb.se/resolve?urn=urn:nbn:se:bth-00596
http://nbn-resolving.de/urn:isbn:978-91-7295-287-4
work_keys_str_mv AT erlandssonfredrik onsocialinteractionmetricssocialnetworkcrawlingbasedoninterestingness
_version_ 1718680112202776576