Storage-Optimized Data-Atomic Algorithms for Handling Erasures and Errors in Distributed Storage Systems

Erasure codes are increasingly being studied in the context of implementing atomic memory objects in large scale asynchronous distributed storage systems. When compared with the traditional replication based schemes, erasure codes have the potential of significantly lowering storage and communicatio...

Full description

Bibliographic Details
Main Authors: Kantor, Erez (Author), Schwarzmann, Alexander A. (Author), Konwar, Kishori Mohan (Contributor), Prakash, N. (Contributor), Lynch, Nancy Ann (Contributor), Medard, Muriel (Contributor)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers (IEEE), 2017-12-08T22:29:21Z.
Subjects:
Online Access:Get fulltext
LEADER 02926 am a22002773u 4500
001 112674
042 |a dc 
100 1 0 |a Kantor, Erez  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Konwar, Kishori Mohan  |e contributor 
100 1 0 |a Prakash, N.  |e contributor 
100 1 0 |a Lynch, Nancy Ann  |e contributor 
100 1 0 |a Medard, Muriel  |e contributor 
700 1 0 |a Schwarzmann, Alexander A.  |e author 
700 1 0 |a Konwar, Kishori Mohan  |e author 
700 1 0 |a Prakash, N.  |e author 
700 1 0 |a Lynch, Nancy Ann  |e author 
700 1 0 |a Medard, Muriel  |e author 
245 0 0 |a Storage-Optimized Data-Atomic Algorithms for Handling Erasures and Errors in Distributed Storage Systems 
260 |b Institute of Electrical and Electronics Engineers (IEEE),   |c 2017-12-08T22:29:21Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/112674 
520 |a Erasure codes are increasingly being studied in the context of implementing atomic memory objects in large scale asynchronous distributed storage systems. When compared with the traditional replication based schemes, erasure codes have the potential of significantly lowering storage and communication costs while simultaneously guaranteeing the desired resiliency levels. In this work, we propose the Storage-Optimized Data-Atomic (SODA) algorithm for implementing atomic memory objects in the multi-writer multi-reader setting. SODA uses Maximum Distance Separable (MDS) codes, and is specifically designed to optimize the total storage cost for a given fault-tolerance requirement. For tolerating f server crashes in an n-server system, SODA uses an [n, k] MDS code with k = n - f, and incurs a total storage cost of n/n-f. SODA is designed under the assumption of reliable point-to-point communication channels. The communication cost of a write and a read operation are respectively given by O(f[superscript 2]) and n/n-f(δ[subscript w]+1), where δ[subscript w] denotes the number of writes that are concurrent with the particular read. In comparison with the recent CASGC algorithm [1], which also uses MDS codes, SODA offers lower storage cost while pays more on the communication cost. We also present a modification of SODA, called SODA[subscript err], to handle the case where some of the servers can return erroneous coded elements during a read operation. Specifically, in order to tolerate f server failures and e error-prone coded elements, the SODAerr algorithm uses an [n, k] MDS code such that k = n - 2e - f. SODAerr also guarantees liveness and atomicity, while maintaining an optimized total storage cost of n/n-f-2e. 
546 |a en_US 
655 7 |a Article 
773 |t 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS)