Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks

A structured overlay technology has the advantages for fault tolerance and computation resource (i.e., node) discovery in distributed data storage and its computation platform, however, these strengths are only guaranteed on stable environment that node failures do not occur frequently. To deal with...

Full description

Bibliographic Details
Main Author: Kimihiro Mizutani
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9458243/
id doaj-7ec960c917ac4504b463f4feccdc232c
record_format Article
spelling doaj-7ec960c917ac4504b463f4feccdc232c2021-06-24T23:00:13ZengIEEEIEEE Access2169-35362021-01-019887378874510.1109/ACCESS.2021.30900289458243Stateless Node Failure Information Propagation Scheme for Stable Overlay NetworksKimihiro Mizutani0https://orcid.org/0000-0003-2020-6578Department of Informatics and Cyber Informatics Research Institute, Kindai University, Osaka, JapanA structured overlay technology has the advantages for fault tolerance and computation resource (i.e., node) discovery in distributed data storage and its computation platform, however, these strengths are only guaranteed on stable environment that node failures do not occur frequently. To deal with the environment, many advanced schemes based on the well-known node failure information propagation scheme are proposed, which stabilizes the platform by quickly handling node failures. In the existing scheme, a computation node propagates a node-failure information when the node detect its failure. However, the existing scheme needs stateful maintenance against propagation targets; in other words, it must maintain the network connections of both the propagation target nodes and the nodes held on the general overlay. The nodes then exhaust the machine resources (e.g., CPU, memory, network bandwidth) for the connection management and cannot concentrates on their own tasks, such as data analysis or its storage application. To resolve this problem, I propose a stateless node-failure information propagation scheme, which propagates a node failure at the speed of the existing scheme but without requiring maintenance of the propagation target connections. In the proposed scheme, each computational node can effectively utilize the machine resources for its own task. Instead of retaining the propagation targets, my scheme estimates the propagation targets after detecting a node failure. I analyzed the estimation accuracy of a simple propagation model, which guarantees effective propagation. The accuracy was found to depend on the overlay distance between the failed node and the propagator node. Based on this observation, my scheme adjusts the keep-alive interval to bias the detection of closer node failures. In a simulation evaluation, the detection delay of the proposed stateless propagation was similar to that of the stateful propagation scheme, but delivered superior maintenance cost and scalability.https://ieeexplore.ieee.org/document/9458243/Peer-to-peer computingoverlay networksfault tolerance
collection DOAJ
language English
format Article
sources DOAJ
author Kimihiro Mizutani
spellingShingle Kimihiro Mizutani
Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
IEEE Access
Peer-to-peer computing
overlay networks
fault tolerance
author_facet Kimihiro Mizutani
author_sort Kimihiro Mizutani
title Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
title_short Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
title_full Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
title_fullStr Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
title_full_unstemmed Stateless Node Failure Information Propagation Scheme for Stable Overlay Networks
title_sort stateless node failure information propagation scheme for stable overlay networks
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description A structured overlay technology has the advantages for fault tolerance and computation resource (i.e., node) discovery in distributed data storage and its computation platform, however, these strengths are only guaranteed on stable environment that node failures do not occur frequently. To deal with the environment, many advanced schemes based on the well-known node failure information propagation scheme are proposed, which stabilizes the platform by quickly handling node failures. In the existing scheme, a computation node propagates a node-failure information when the node detect its failure. However, the existing scheme needs stateful maintenance against propagation targets; in other words, it must maintain the network connections of both the propagation target nodes and the nodes held on the general overlay. The nodes then exhaust the machine resources (e.g., CPU, memory, network bandwidth) for the connection management and cannot concentrates on their own tasks, such as data analysis or its storage application. To resolve this problem, I propose a stateless node-failure information propagation scheme, which propagates a node failure at the speed of the existing scheme but without requiring maintenance of the propagation target connections. In the proposed scheme, each computational node can effectively utilize the machine resources for its own task. Instead of retaining the propagation targets, my scheme estimates the propagation targets after detecting a node failure. I analyzed the estimation accuracy of a simple propagation model, which guarantees effective propagation. The accuracy was found to depend on the overlay distance between the failed node and the propagator node. Based on this observation, my scheme adjusts the keep-alive interval to bias the detection of closer node failures. In a simulation evaluation, the detection delay of the proposed stateless propagation was similar to that of the stateful propagation scheme, but delivered superior maintenance cost and scalability.
topic Peer-to-peer computing
overlay networks
fault tolerance
url https://ieeexplore.ieee.org/document/9458243/
work_keys_str_mv AT kimihiromizutani statelessnodefailureinformationpropagationschemeforstableoverlaynetworks
_version_ 1721360978622283776