ALGORITHMS FOR FAULT TOLERANCE IN DISTRIBUTED SYSTEMS AND ROUTING IN AD HOC NETWORKS

Checkpointing and rollback recovery are well-known techniques for coping with failures in distributed systems. Future generation Supercomputers will be message passing distributed systems consisting of millions of processors. As the number of processors grow, failure rate also grows. Thus, designing...

Full description

Bibliographic Details
Main Author: Jiang, Qiangfeng
Format: Others
Published: UKnowledge 2013
Subjects:
Online Access:http://uknowledge.uky.edu/cs_etds/16
http://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1018&context=cs_etds