Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation

Generated and collected data have been rising with the popularization of technologies such as Internet of Things, social media, and smartphone, leading big data term creation. One class of big data hidden information is causality. Among the tools to infer causal relationships, there is Delay Transfe...

Full description

Bibliographic Details
Main Authors: Jonas R. Dourado, Jordão Natal de Oliveira Júnior, Carlos D. Maciel
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/12/9/190
Description
Summary:Generated and collected data have been rising with the popularization of technologies such as Internet of Things, social media, and smartphone, leading big data term creation. One class of big data hidden information is causality. Among the tools to infer causal relationships, there is Delay Transfer Entropy (DTE); however, it has a high demanding processing power. Many approaches were proposed to overcome DTE performance issues such as GPU and FPGA implementations. Our study compared different parallel strategies to calculate DTE from big data series using a heterogeneous Beowulf cluster. Task Parallelism was significantly faster in comparison to Data Parallelism. With big data trend in sight, these results may enable bigger datasets analysis or better statistical evidence.
ISSN:1999-4893