Checkpointing Algorithms for Parallel Computers
Checkpointing is a technique widely used in parallel/distributed computers for rollback error recovery. Checkpointing is defined as the coordinated saving of process state information at specified time instances. Checkpoints help in restoring the computation from the latest saved state, in case of f...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | en |
Published: |
Indian Institute of Science
2005
|
Subjects: | |
Online Access: | http://hdl.handle.net/2005/67 |