NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection

Collecting and analysing data of analytical value is important for many service providers today. Many make use of NoSQL databases for their larger software systems, what is less known is how to effectively analyse and gather business intelligence from the data in these systems. This paper suggests a...

Full description

Bibliographic Details
Main Author: Klingsbo, Lukas
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2015
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-260394
id ndltd-UPSALLA1-oai-DiVA.org-uu-260394
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-2603942015-08-19T04:55:26ZNoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data CollectionengKlingsbo, LukasUppsala universitet, Institutionen för informationsteknologi2015Collecting and analysing data of analytical value is important for many service providers today. Many make use of NoSQL databases for their larger software systems, what is less known is how to effectively analyse and gather business intelligence from the data in these systems. This paper suggests a method of separating the most valuable analytical data from the rest in real time and at the same time providing an effective traditional database for the analyser. In this paper we analyse our given data sets to decide whether big data tools are required and then traditional databases are compared to see how well they fit the context. A technique that makes use of an asynchronous log- ging system is used to insert the data from the main system to the dedicated analytical database. The tests show that our technique can efficiently be used with a tra- ditional database even on large data sets (>1000000 insertions/hour per database node) and still provide both historical data and aggregate func- tions for the analyser. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-260394IT ; 15025application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description Collecting and analysing data of analytical value is important for many service providers today. Many make use of NoSQL databases for their larger software systems, what is less known is how to effectively analyse and gather business intelligence from the data in these systems. This paper suggests a method of separating the most valuable analytical data from the rest in real time and at the same time providing an effective traditional database for the analyser. In this paper we analyse our given data sets to decide whether big data tools are required and then traditional databases are compared to see how well they fit the context. A technique that makes use of an asynchronous log- ging system is used to insert the data from the main system to the dedicated analytical database. The tests show that our technique can efficiently be used with a tra- ditional database even on large data sets (>1000000 insertions/hour per database node) and still provide both historical data and aggregate func- tions for the analyser.
author Klingsbo, Lukas
spellingShingle Klingsbo, Lukas
NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
author_facet Klingsbo, Lukas
author_sort Klingsbo, Lukas
title NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
title_short NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
title_full NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
title_fullStr NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
title_full_unstemmed NoSQL: Moving from MapReduce Batch Jobs to Event-Driven Data Collection
title_sort nosql: moving from mapreduce batch jobs to event-driven data collection
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2015
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-260394
work_keys_str_mv AT klingsbolukas nosqlmovingfrommapreducebatchjobstoeventdrivendatacollection
_version_ 1716817278313431040