Handling Data Flows of Streaming Internet of Things Data

Streaming data in various formats is generated in a very fast way and these data needs to be processed and analyzed before it becomes useless. The technology currently existing provides the tools to process these data and gain more meaningful information out of it. This thesis has two parts: theoret...

Full description

Bibliographic Details
Main Author: Serbessa, Yonatan Kebede
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2016
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-302102
id ndltd-UPSALLA1-oai-DiVA.org-uu-302102
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-3021022016-08-31T05:08:23ZHandling Data Flows of Streaming Internet of Things DataengSerbessa, Yonatan KebedeUppsala universitet, Institutionen för informationsteknologi2016Streaming data in various formats is generated in a very fast way and these data needs to be processed and analyzed before it becomes useless. The technology currently existing provides the tools to process these data and gain more meaningful information out of it. This thesis has two parts: theoretical and practical. The theoretical part investigates what tools are there that are suitable for stream data flow processing and analysis. In doing so, it starts with studying one of the main streaming data source that produce large volumes of data: Internet of Things. In this, the technologies behind it, common use cases, challenges, and solutions are studied. Then it is followed by overview of selected tools namely Apache NiFi, Apache Spark Streaming and Apache Storm studying their key features, main components, and architecture. After the tools are studied, 5 parameters are selected to review how each tool handles these parameters. This can be useful for considering choosing certain tool given the parameters and the use case at hand. The second part of the thesis involves Twitter data analysis which is done using Apache NiFi, one of the tools studied. The purpose is to show how NiFi can be used for processing data starting from ingestion to finally sending it to storage systems. It is also to show how it communicates with external storage, search, and indexing systems. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-302102ITapplication/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description Streaming data in various formats is generated in a very fast way and these data needs to be processed and analyzed before it becomes useless. The technology currently existing provides the tools to process these data and gain more meaningful information out of it. This thesis has two parts: theoretical and practical. The theoretical part investigates what tools are there that are suitable for stream data flow processing and analysis. In doing so, it starts with studying one of the main streaming data source that produce large volumes of data: Internet of Things. In this, the technologies behind it, common use cases, challenges, and solutions are studied. Then it is followed by overview of selected tools namely Apache NiFi, Apache Spark Streaming and Apache Storm studying their key features, main components, and architecture. After the tools are studied, 5 parameters are selected to review how each tool handles these parameters. This can be useful for considering choosing certain tool given the parameters and the use case at hand. The second part of the thesis involves Twitter data analysis which is done using Apache NiFi, one of the tools studied. The purpose is to show how NiFi can be used for processing data starting from ingestion to finally sending it to storage systems. It is also to show how it communicates with external storage, search, and indexing systems.
author Serbessa, Yonatan Kebede
spellingShingle Serbessa, Yonatan Kebede
Handling Data Flows of Streaming Internet of Things Data
author_facet Serbessa, Yonatan Kebede
author_sort Serbessa, Yonatan Kebede
title Handling Data Flows of Streaming Internet of Things Data
title_short Handling Data Flows of Streaming Internet of Things Data
title_full Handling Data Flows of Streaming Internet of Things Data
title_fullStr Handling Data Flows of Streaming Internet of Things Data
title_full_unstemmed Handling Data Flows of Streaming Internet of Things Data
title_sort handling data flows of streaming internet of things data
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2016
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-302102
work_keys_str_mv AT serbessayonatankebede handlingdataflowsofstreaminginternetofthingsdata
_version_ 1718381194296426496