Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.

As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce th...

Full description

Bibliographic Details
Main Authors:	Khatereh Sabaghian, Keyhan Khamforoosh, Abdolbaghi Ghaderzadeh
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2021-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0254210

id	doaj-322aa71e314042998c27589bc8e76701
record_format	Article
spelling	doaj-322aa71e314042998c27589bc8e767012021-07-24T04:32:38ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-01167e025421010.1371/journal.pone.0254210Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.Khatereh SabaghianKeyhan KhamforooshAbdolbaghi GhaderzadehAs the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access.https://doi.org/10.1371/journal.pone.0254210
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Khatereh Sabaghian Keyhan Khamforoosh Abdolbaghi Ghaderzadeh
spellingShingle	Khatereh Sabaghian Keyhan Khamforoosh Abdolbaghi Ghaderzadeh Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments. PLoS ONE
author_facet	Khatereh Sabaghian Keyhan Khamforoosh Abdolbaghi Ghaderzadeh
author_sort	Khatereh Sabaghian
title	Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
title_short	Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
title_full	Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
title_fullStr	Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
title_full_unstemmed	Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
title_sort	presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2021-01-01
description	As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access.
url	https://doi.org/10.1371/journal.pone.0254210
work_keys_str_mv	AT khaterehsabaghian presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments AT keyhankhamforoosh presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments AT abdolbaghighaderzadeh presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments
_version_	1721284333609680896

Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments.

Similar Items