Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation

Generated and collected data have been rising with the popularization of technologies such as Internet of Things, social media, and smartphone, leading big data term creation. One class of big data hidden information is causality. Among the tools to infer causal relationships, there is Delay Transfe...

Full description

Bibliographic Details
Main Authors: Jonas R. Dourado, Jordão Natal de Oliveira Júnior, Carlos D. Maciel
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/12/9/190
id doaj-8368980021794fadadd3557598accfa3
record_format Article
spelling doaj-8368980021794fadadd3557598accfa32020-11-25T01:08:56ZengMDPI AGAlgorithms1999-48932019-09-0112919010.3390/a12090190a12090190Parallelism Strategies for Big Data Delayed Transfer Entropy EvaluationJonas R. Dourado0Jordão Natal de Oliveira Júnior1Carlos D. Maciel2Department of Electrical and Computational Engineering, University of São Paulo, 13566-590 São Carlos-SP, BrazilDepartment of Electrical and Computational Engineering, University of São Paulo, 13566-590 São Carlos-SP, BrazilDepartment of Electrical and Computational Engineering, University of São Paulo, 13566-590 São Carlos-SP, BrazilGenerated and collected data have been rising with the popularization of technologies such as Internet of Things, social media, and smartphone, leading big data term creation. One class of big data hidden information is causality. Among the tools to infer causal relationships, there is Delay Transfer Entropy (DTE); however, it has a high demanding processing power. Many approaches were proposed to overcome DTE performance issues such as GPU and FPGA implementations. Our study compared different parallel strategies to calculate DTE from big data series using a heterogeneous Beowulf cluster. Task Parallelism was significantly faster in comparison to Data Parallelism. With big data trend in sight, these results may enable bigger datasets analysis or better statistical evidence.https://www.mdpi.com/1999-4893/12/9/190delayed transfer entropyparallelism strategiesbig data analysisheterogeneous computer clustercomplex systemscausalitysurrogate
collection DOAJ
language English
format Article
sources DOAJ
author Jonas R. Dourado
Jordão Natal de Oliveira Júnior
Carlos D. Maciel
spellingShingle Jonas R. Dourado
Jordão Natal de Oliveira Júnior
Carlos D. Maciel
Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
Algorithms
delayed transfer entropy
parallelism strategies
big data analysis
heterogeneous computer cluster
complex systems
causality
surrogate
author_facet Jonas R. Dourado
Jordão Natal de Oliveira Júnior
Carlos D. Maciel
author_sort Jonas R. Dourado
title Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
title_short Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
title_full Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
title_fullStr Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
title_full_unstemmed Parallelism Strategies for Big Data Delayed Transfer Entropy Evaluation
title_sort parallelism strategies for big data delayed transfer entropy evaluation
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2019-09-01
description Generated and collected data have been rising with the popularization of technologies such as Internet of Things, social media, and smartphone, leading big data term creation. One class of big data hidden information is causality. Among the tools to infer causal relationships, there is Delay Transfer Entropy (DTE); however, it has a high demanding processing power. Many approaches were proposed to overcome DTE performance issues such as GPU and FPGA implementations. Our study compared different parallel strategies to calculate DTE from big data series using a heterogeneous Beowulf cluster. Task Parallelism was significantly faster in comparison to Data Parallelism. With big data trend in sight, these results may enable bigger datasets analysis or better statistical evidence.
topic delayed transfer entropy
parallelism strategies
big data analysis
heterogeneous computer cluster
complex systems
causality
surrogate
url https://www.mdpi.com/1999-4893/12/9/190
work_keys_str_mv AT jonasrdourado parallelismstrategiesforbigdatadelayedtransferentropyevaluation
AT jordaonataldeoliveirajunior parallelismstrategiesforbigdatadelayedtransferentropyevaluation
AT carlosdmaciel parallelismstrategiesforbigdatadelayedtransferentropyevaluation
_version_ 1725180896099696640