Distributed system fault tolerance using message logging and checkpointing
Fault tolerance can allow processes executing in a computer system to survive failures within the system. This thesis addresses the theory and practice of transparent fault-tolerance methods using message logging and checkpointing in distributed systems. A general model for reasoning about the behav...
Main Author: | Johnson, David Bruce |
---|---|
Other Authors: | Zwaenepoel, Willy |
Format: | Others |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/1911/16354 |
Similar Items
-
Authenticated messages for a real-time fault-tolerant computer system
by: Chau, David Chi-Shing
Published: (2007) -
A message passing system for a fault tolerant parallel processor
by: Heyda, Russell Lawrence
Published: (2005) -
Manetho: Fault tolerance in distributed systems using rollback-recovery and process replication
by: Elnozahy, Elmootazbellah Nabil
Published: (2009) -
Communication-Induced Checkpointing with Message Logging beyond the Piecewise Deterministic (PWD) Model for Distributed Systems
by: Jinho Ahn
Published: (2021-06-01) -
The design of fault tolerant software for loosely coupled distributed systems
by: Tyrrell, Andrew M.
Published: (1987)