ALGORITHMS FOR FAULT TOLERANCE IN DISTRIBUTED SYSTEMS AND ROUTING IN AD HOC NETWORKS
Checkpointing and rollback recovery are well-known techniques for coping with failures in distributed systems. Future generation Supercomputers will be message passing distributed systems consisting of millions of processors. As the number of processors grow, failure rate also grows. Thus, designing...
Main Author: | |
---|---|
Format: | Others |
Published: |
UKnowledge
2013
|
Subjects: | |
Online Access: | http://uknowledge.uky.edu/cs_etds/16 http://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1018&context=cs_etds |