A System for Remote Multi-Program Checkpointing and Error Recovery

碩士 === 國立中興大學 === 電機工程學系所 === 95 === The checkpoint fault-tolerant technique has wildly discussed in many research studies, especially at communications and databases designing and implementing. The basic conception of checkpoint is to preserve the state and data of the target running process at the...

Full description

Bibliographic Details
Main Authors: Sheng-Kuei Wang, 王聖貴
Other Authors: 蔡智強
Format: Others
Language:zh-TW
Online Access:http://ndltd.ncl.edu.tw/handle/99811736148925090070
id ndltd-TW-095NCHU5441039
record_format oai_dc
spelling ndltd-TW-095NCHU54410392017-07-09T04:29:41Z http://ndltd.ncl.edu.tw/handle/99811736148925090070 A System for Remote Multi-Program Checkpointing and Error Recovery 遠端多程式檢查點與錯誤回復之系統 Sheng-Kuei Wang 王聖貴 碩士 國立中興大學 電機工程學系所 95 The checkpoint fault-tolerant technique has wildly discussed in many research studies, especially at communications and databases designing and implementing. The basic conception of checkpoint is to preserve the state and data of the target running process at the moment in order to recover the state and data when the process has failed. The checkpoint and error recovery can be implemented on software or hardware, and cooperated with both is also usual way. However, implementing checkpoint and error recovery techniques in software fault-tolerance almost choose Unix/Linux OS as the implementing platform, in the other way select the Microsoft Windows OS as the implementing platform is comparably much less than common. This paper is implementing the checkpoint and rollback-recovery mechanism in software manner on Windows OS platform. The implementation can be divided into two parts. One is the computing of the checkpoint and error recovery, and another is the intercepting system API. I also implement multi-program checkpoint and periodical checkpoint in this paper. In multi-program checkpoint, the checkpoint information should be saved as program related file; in periodical checkpoint, a sample interface is provided by the remote console and the user use it that choose different checkpoint intervals for different application programs. Finally by TCP/IP networking and remote checkpointing server interaction work pattern, the main goal is to construct a Client/Server checkpoint foundation model in Windows networking environment. 蔡智強 學位論文 ; thesis 49 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中興大學 === 電機工程學系所 === 95 === The checkpoint fault-tolerant technique has wildly discussed in many research studies, especially at communications and databases designing and implementing. The basic conception of checkpoint is to preserve the state and data of the target running process at the moment in order to recover the state and data when the process has failed. The checkpoint and error recovery can be implemented on software or hardware, and cooperated with both is also usual way. However, implementing checkpoint and error recovery techniques in software fault-tolerance almost choose Unix/Linux OS as the implementing platform, in the other way select the Microsoft Windows OS as the implementing platform is comparably much less than common. This paper is implementing the checkpoint and rollback-recovery mechanism in software manner on Windows OS platform. The implementation can be divided into two parts. One is the computing of the checkpoint and error recovery, and another is the intercepting system API. I also implement multi-program checkpoint and periodical checkpoint in this paper. In multi-program checkpoint, the checkpoint information should be saved as program related file; in periodical checkpoint, a sample interface is provided by the remote console and the user use it that choose different checkpoint intervals for different application programs. Finally by TCP/IP networking and remote checkpointing server interaction work pattern, the main goal is to construct a Client/Server checkpoint foundation model in Windows networking environment.
author2 蔡智強
author_facet 蔡智強
Sheng-Kuei Wang
王聖貴
author Sheng-Kuei Wang
王聖貴
spellingShingle Sheng-Kuei Wang
王聖貴
A System for Remote Multi-Program Checkpointing and Error Recovery
author_sort Sheng-Kuei Wang
title A System for Remote Multi-Program Checkpointing and Error Recovery
title_short A System for Remote Multi-Program Checkpointing and Error Recovery
title_full A System for Remote Multi-Program Checkpointing and Error Recovery
title_fullStr A System for Remote Multi-Program Checkpointing and Error Recovery
title_full_unstemmed A System for Remote Multi-Program Checkpointing and Error Recovery
title_sort system for remote multi-program checkpointing and error recovery
url http://ndltd.ncl.edu.tw/handle/99811736148925090070
work_keys_str_mv AT shengkueiwang asystemforremotemultiprogramcheckpointinganderrorrecovery
AT wángshèngguì asystemforremotemultiprogramcheckpointinganderrorrecovery
AT shengkueiwang yuǎnduānduōchéngshìjiǎnchádiǎnyǔcuòwùhuífùzhīxìtǒng
AT wángshèngguì yuǎnduānduōchéngshìjiǎnchádiǎnyǔcuòwùhuífùzhīxìtǒng
AT shengkueiwang systemforremotemultiprogramcheckpointinganderrorrecovery
AT wángshèngguì systemforremotemultiprogramcheckpointinganderrorrecovery
_version_ 1718493877858467840