Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
This thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correc...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Mittuniversitetet, Avdelningen för informations- och kommunikationssystem
2013
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901 |
id |
ndltd-UPSALLA1-oai-DiVA.org-miun-20901 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-miun-209012018-01-12T05:12:36ZPreventing data loss using rollback-recovery : A proof-of-concept study at BolagsverketengSjölinder, MaxMittuniversitetet, Avdelningen för informations- och kommunikationssystem2013Fault toleranceRollback-recoveryData lossDatabaseBolagsverketComputer and Information SciencesData- och informationsvetenskapSoftware EngineeringProgramvaruteknikComputer EngineeringDatorteknikThis thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correct data from a database that belongs to a stateless distributed system and that contains erroneous- or inaccurate information due to past faults. Both approaches utilize log-based rollback-recovery techniques but make different assumptions regarding the deterministic behaviour of Bolagsverket’s systems. A stateless distributed system logs all received messages during failure-free operation. During recovery, automatic replay recovers the data by enabling the system to re-process the logged messages. In contrast, semi-automatic replay recovers data by utilizing the logged messages to enable officials at Bolagsverket to manually redo lost work in a controlled manner. Proof-of-concept implementations of the two replay approaches are developed on a simplified model that resembles one of Bolagsverket’s electronic services, yet that is general to any stateless system that communicates asynchronously using JMS messages and synchronously using XML sent over HTTP. The theoretical- and performance evaluation was conducted with the aim of producing results general to any system with similar characteristics to those of the model. The results suggest that the failure-free overhead at Bolagsverket is approximately 100 milliseconds per logged message, and that around 3 gigabytes of data must be stored in order to recover one average day’s operation. Further, automatic replay successfully manages to recover one average day’s operation in around 70 minutes. Semi-automatic replay is calculated to require, at a maximum, one workday to recover the same amount of data. It is assessed that automatic replay is a suitable solution for Bolagsverket if it is proven that their systems are fully deterministic. In other cases, it is assessed that semi-automatic replay can be utilized. It is however recommended that further evaluations are conducted before the approaches are implemented in a production environment. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Fault tolerance Rollback-recovery Data loss Database Bolagsverket Computer and Information Sciences Data- och informationsvetenskap Software Engineering Programvaruteknik Computer Engineering Datorteknik |
spellingShingle |
Fault tolerance Rollback-recovery Data loss Database Bolagsverket Computer and Information Sciences Data- och informationsvetenskap Software Engineering Programvaruteknik Computer Engineering Datorteknik Sjölinder, Max Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
description |
This thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correct data from a database that belongs to a stateless distributed system and that contains erroneous- or inaccurate information due to past faults. Both approaches utilize log-based rollback-recovery techniques but make different assumptions regarding the deterministic behaviour of Bolagsverket’s systems. A stateless distributed system logs all received messages during failure-free operation. During recovery, automatic replay recovers the data by enabling the system to re-process the logged messages. In contrast, semi-automatic replay recovers data by utilizing the logged messages to enable officials at Bolagsverket to manually redo lost work in a controlled manner. Proof-of-concept implementations of the two replay approaches are developed on a simplified model that resembles one of Bolagsverket’s electronic services, yet that is general to any stateless system that communicates asynchronously using JMS messages and synchronously using XML sent over HTTP. The theoretical- and performance evaluation was conducted with the aim of producing results general to any system with similar characteristics to those of the model. The results suggest that the failure-free overhead at Bolagsverket is approximately 100 milliseconds per logged message, and that around 3 gigabytes of data must be stored in order to recover one average day’s operation. Further, automatic replay successfully manages to recover one average day’s operation in around 70 minutes. Semi-automatic replay is calculated to require, at a maximum, one workday to recover the same amount of data. It is assessed that automatic replay is a suitable solution for Bolagsverket if it is proven that their systems are fully deterministic. In other cases, it is assessed that semi-automatic replay can be utilized. It is however recommended that further evaluations are conducted before the approaches are implemented in a production environment. |
author |
Sjölinder, Max |
author_facet |
Sjölinder, Max |
author_sort |
Sjölinder, Max |
title |
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
title_short |
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
title_full |
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
title_fullStr |
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
title_full_unstemmed |
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket |
title_sort |
preventing data loss using rollback-recovery : a proof-of-concept study at bolagsverket |
publisher |
Mittuniversitetet, Avdelningen för informations- och kommunikationssystem |
publishDate |
2013 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901 |
work_keys_str_mv |
AT sjolindermax preventingdatalossusingrollbackrecoveryaproofofconceptstudyatbolagsverket |
_version_ |
1718606526067769344 |