Distributed system fault tolerance using message logging and checkpointing

Fault tolerance can allow processes executing in a computer system to survive failures within the system. This thesis addresses the theory and practice of transparent fault-tolerance methods using message logging and checkpointing in distributed systems. A general model for reasoning about the behav...

Full description

Bibliographic Details
Main Author: Johnson, David Bruce
Other Authors: Zwaenepoel, Willy
Format: Others
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/1911/16354

Similar Items