Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern
This paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the deci...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2021/6667275 |
id |
doaj-5455d38e55674ac0b85bde2000741454 |
---|---|
record_format |
Article |
spelling |
doaj-5455d38e55674ac0b85bde20007414542021-02-15T12:53:10ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472021-01-01202110.1155/2021/66672756667275Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention PatternPing Xu0School of Information Science and Technology, Taishan University, Taian, Shandong 271000, ChinaThis paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the decision tree model to solve the problem of component selection in Big Data application development. By establishing standardized demand indicators and based on the retention model, a data transmission intermediate platform for bidirectional data detection is proposed based on the three demands of user input: storage, computation, and analysis, as well as the problem of undetectable packet loss in data transmission of existing IoT and Web service platforms. The data communication module of the data transmission intermediate platform enables mutual monitoring and detection of data interaction between IoT smart terminals and cloud platforms. The retention mode is built separately to realize the automatic selection of Big Data components. In this paper, we start from several mainstream distributed storage systems and use Cassandra as an example for experiments and tests. We use the multiple regression fitting method to build a corresponding performance model for hardware parameters, take user requirements as input, and use the performance model to configure system hardware parameters; by studying its system principle, architecture, features, and application scenarios, we build a software parameter configuration knowledge base to guide the software. This solves the difficult problem of selecting, deploying, and configuring parameters for Big Data applications.http://dx.doi.org/10.1155/2021/6667275 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ping Xu |
spellingShingle |
Ping Xu Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern Mathematical Problems in Engineering |
author_facet |
Ping Xu |
author_sort |
Ping Xu |
title |
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern |
title_short |
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern |
title_full |
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern |
title_fullStr |
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern |
title_full_unstemmed |
Automatic Selection and Parameter Configuration of Big Data Software Core Components Based on Retention Pattern |
title_sort |
automatic selection and parameter configuration of big data software core components based on retention pattern |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2021-01-01 |
description |
This paper conducts an in-depth analysis and research on the automatic selection and parameter configuration of the core components of Big Data software by using the retention model and the automatic selection of Big Data components by establishing a standardized requirement index and using the decision tree model to solve the problem of component selection in Big Data application development. By establishing standardized demand indicators and based on the retention model, a data transmission intermediate platform for bidirectional data detection is proposed based on the three demands of user input: storage, computation, and analysis, as well as the problem of undetectable packet loss in data transmission of existing IoT and Web service platforms. The data communication module of the data transmission intermediate platform enables mutual monitoring and detection of data interaction between IoT smart terminals and cloud platforms. The retention mode is built separately to realize the automatic selection of Big Data components. In this paper, we start from several mainstream distributed storage systems and use Cassandra as an example for experiments and tests. We use the multiple regression fitting method to build a corresponding performance model for hardware parameters, take user requirements as input, and use the performance model to configure system hardware parameters; by studying its system principle, architecture, features, and application scenarios, we build a software parameter configuration knowledge base to guide the software. This solves the difficult problem of selecting, deploying, and configuring parameters for Big Data applications. |
url |
http://dx.doi.org/10.1155/2021/6667275 |
work_keys_str_mv |
AT pingxu automaticselectionandparameterconfigurationofbigdatasoftwarecorecomponentsbasedonretentionpattern |
_version_ |
1714866481881153536 |