Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern

This paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the deci...

Full description

Bibliographic Details
Main Author: Ping Xu
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2021/6667275
id doaj-5455d38e55674ac0b85bde2000741454
record_format Article
spelling doaj-5455d38e55674ac0b85bde20007414542021-02-15T12:53:10ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472021-01-01202110.1155/2021/66672756667275Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention PatternPing Xu0School of Information Science and Technology, Taishan University, Taian, Shandong 271000, ChinaThis paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the decision tree model to solve the problem of component selection in Big Data application development. By establishing standardized demand indicators and based on the retention model, a data transmission intermediate platform for bidirectional data detection is proposed based on the three demands of user input: storage, computation, and analysis, as well as the problem of undetectable packet loss in data transmission of existing IoT  and Web service platforms. The data communication module of the data transmission intermediate platform enables mutual monitoring and detection of data interaction between IoT  smart terminals and cloud platforms. The retention mode is built separately to realize the automatic selection of Big Data components. In this paper, we start from several mainstream distributed storage systems and use Cassandra as an example for experiments and tests. We use the multiple regression fitting method to build a corresponding performance model for hardware parameters, take user requirements as input, and use the performance model to configure system hardware parameters; by studying its system principle, architecture, features, and application scenarios, we build a software parameter configuration knowledge base to guide the software. This solves the difficult problem of selecting, deploying, and configuring parameters for Big Data applications.http://dx.doi.org/10.1155/2021/6667275
collection DOAJ
language English
format Article
sources DOAJ
author Ping Xu
spellingShingle Ping Xu
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
Mathematical Problems in Engineering
author_facet Ping Xu
author_sort Ping Xu
title Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
title_short Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
title_full Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
title_fullStr Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
title_full_unstemmed Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
title_sort automatic selection and parameter configuration of big data software core components based on retention pattern
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2021-01-01
description This paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the decision tree model to solve the problem of component selection in Big Data application development. By establishing standardized demand indicators and based on the retention model, a data transmission intermediate platform for bidirectional data detection is proposed based on the three demands of user input: storage, computation, and analysis, as well as the problem of undetectable packet loss in data transmission of existing IoT  and Web service platforms. The data communication module of the data transmission intermediate platform enables mutual monitoring and detection of data interaction between IoT  smart terminals and cloud platforms. The retention mode is built separately to realize the automatic selection of Big Data components. In this paper, we start from several mainstream distributed storage systems and use Cassandra as an example for experiments and tests. We use the multiple regression fitting method to build a corresponding performance model for hardware parameters, take user requirements as input, and use the performance model to configure system hardware parameters; by studying its system principle, architecture, features, and application scenarios, we build a software parameter configuration knowledge base to guide the software. This solves the difficult problem of selecting, deploying, and configuring parameters for Big Data applications.
url http://dx.doi.org/10.1155/2021/6667275
work_keys_str_mv AT pingxu automaticselectionandparameterconfigurationofbigdatasoftwarecorecomponentsbasedonretentionpattern
_version_ 1714866481881153536