Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket

This thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correc...

Full description

Bibliographic Details
Main Author: Sjölinder, Max
Format: Others
Language:English
Published: Mittuniversitetet, Avdelningen för informations- och kommunikationssystem 2013
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901
id ndltd-UPSALLA1-oai-DiVA.org-miun-20901
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-miun-209012018-01-12T05:12:36ZPreventing data loss using rollback-recovery : A proof-of-concept study at BolagsverketengSjölinder, MaxMittuniversitetet, Avdelningen för informations- och kommunikationssystem2013Fault toleranceRollback-recoveryData lossDatabaseBolagsverketComputer and Information SciencesData- och informationsvetenskapSoftware EngineeringProgramvaruteknikComputer EngineeringDatorteknikThis thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correct data from a database that belongs to a stateless distributed system and that contains erroneous- or inaccurate information due to past faults. Both approaches utilize log-based rollback-recovery techniques but make different assumptions regarding the deterministic behaviour of Bolagsverket’s systems. A stateless distributed system logs all received messages during failure-free operation. During recovery, automatic replay recovers the data by enabling the system to re-process the logged messages. In contrast, semi-automatic replay recovers data by utilizing the logged messages to enable officials at Bolagsverket to manually redo lost work in a controlled manner. Proof-of-concept implementations of the two replay approaches are developed on a simplified model that resembles one of Bolagsverket’s electronic services, yet that is general to any stateless system that communicates asynchronously using JMS messages and synchronously using XML sent over HTTP. The theoretical- and performance evaluation was conducted with the aim of producing results general to any system with similar characteristics to those of the model. The results suggest that the failure-free overhead at Bolagsverket is approximately 100 milliseconds per logged message, and that around 3 gigabytes of data must be stored in order to recover one average day’s operation. Further, automatic replay successfully manages to recover one average day’s operation in around 70 minutes. Semi-automatic replay is calculated to require, at a maximum, one workday to recover the same amount of data. It is assessed that automatic replay is a suitable solution for Bolagsverket if it is proven that their systems are fully deterministic. In other cases, it is assessed that semi-automatic replay can be utilized. It is however recommended that further evaluations are conducted before the approaches are implemented in a production environment. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Fault tolerance
Rollback-recovery
Data loss
Database
Bolagsverket
Computer and Information Sciences
Data- och informationsvetenskap
Software Engineering
Programvaruteknik
Computer Engineering
Datorteknik
spellingShingle Fault tolerance
Rollback-recovery
Data loss
Database
Bolagsverket
Computer and Information Sciences
Data- och informationsvetenskap
Software Engineering
Programvaruteknik
Computer Engineering
Datorteknik
Sjölinder, Max
Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
description This thesis investigates two alternative approaches, referred to as automatic- and semi-automatic replay, which can be used to prevent data loss due to a certain set of unforeseen events at Bolagsverket, the Swedish Companies Registration Office. The approaches make it possible to recover the correct data from a database that belongs to a stateless distributed system and that contains erroneous- or inaccurate information due to past faults. Both approaches utilize log-based rollback-recovery techniques but make different assumptions regarding the deterministic behaviour of Bolagsverket’s systems. A stateless distributed system logs all received messages during failure-free operation. During recovery, automatic replay recovers the data by enabling the system to re-process the logged messages. In contrast, semi-automatic replay recovers data by utilizing the logged messages to enable officials at Bolagsverket to manually redo lost work in a controlled manner. Proof-of-concept implementations of the two replay approaches are developed on a simplified model that resembles one of Bolagsverket’s electronic services, yet that is general to any stateless system that communicates asynchronously using JMS messages and synchronously using XML sent over HTTP. The theoretical- and performance evaluation was conducted with the aim of producing results general to any system with similar characteristics to those of the model. The results suggest that the failure-free overhead at Bolagsverket is approximately 100 milliseconds per logged message, and that around 3 gigabytes of data must be stored in order to recover one average day’s operation. Further, automatic replay successfully manages to recover one average day’s operation in around 70 minutes. Semi-automatic replay is calculated to require, at a maximum, one workday to recover the same amount of data. It is assessed that automatic replay is a suitable solution for Bolagsverket if it is proven that their systems are fully deterministic. In other cases, it is assessed that semi-automatic replay can be utilized. It is however recommended that further evaluations are conducted before the approaches are implemented in a production environment.
author Sjölinder, Max
author_facet Sjölinder, Max
author_sort Sjölinder, Max
title Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
title_short Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
title_full Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
title_fullStr Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
title_full_unstemmed Preventing data loss using rollback-recovery : A proof-of-concept study at Bolagsverket
title_sort preventing data loss using rollback-recovery : a proof-of-concept study at bolagsverket
publisher Mittuniversitetet, Avdelningen för informations- och kommunikationssystem
publishDate 2013
url http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-20901
work_keys_str_mv AT sjolindermax preventingdatalossusingrollbackrecoveryaproofofconceptstudyatbolagsverket
_version_ 1718606526067769344