Comparative Analysis of Skew-Join Strategies for Large-Scale Datasets with MapReduce and Spark
In the era of data deluge, Big Data gradually offers numerous opportunities, but also poses significant challenges to conventional data processing and analysis methods. MapReduce has become a prominent parallel and distributed programming model for efficiently handling such massive datasets. One of...
Main Authors: | Cao, H.-P (Author), Phan, A.-C (Author), Phan, T.-C (Author), Trieu, T.-N (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2022
|
Subjects: | |
Online Access: | View Fulltext in Publisher |
Similar Items
-
Optimization for big joins and recursive query evaluation using intersection and difference filters in MapReduce
by: Phan, Thuong-Cang
Published: (2014) -
Time Estimation and Resource Minimization Scheme for Apache Spark and Hadoop Big Data Systems With Failures
by: Jinbae Lee, et al.
Published: (2019-01-01) -
Large Scale Implementations for Twitter Sentiment Classification
by: Andreas Kanavos, et al.
Published: (2017-03-01) -
MAPSkew: Metaheuristic Approaches for Partitioning Skew in MapReduce
by: Matheus H. M. Pericini, et al.
Published: (2018-12-01) -
Skewness-Based Partitioning in SpatialHadoop
by: Alberto Belussi, et al.
Published: (2020-03-01)