Automation of Flexible Migration Workflows

Many digital preservation scenarios are based on the migration strategy, which itself is heavily tool-dependent. For popular, well-defined and often open file formats – e.g., digital images, such as PNG, GIF, JPEG – a wide range of tools exist. Migration workflows become more difficult with propriet...

Full description

Bibliographic Details
Main Authors: Dirk von Suchodoletz, Klaus Rechert, Randolph Welte, Maurice van den Dobbelsteen, Bill Roberts, Jeffrey van der Hoeven, Jasper Schroder
Format: Article
Language:English
Published: University of Edinburgh 2011-03-01
Series:International Journal of Digital Curation
Online Access:http://www.ijdc.net/index.php/ijdc/article/view/172
id doaj-a99be9a416c74e0da45f96e3c699c597
record_format Article
spelling doaj-a99be9a416c74e0da45f96e3c699c5972020-11-24T23:53:38ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562011-03-016118319810.2218/ijdc.v6i1.181164Automation of Flexible Migration WorkflowsDirk von SuchodoletzKlaus RechertRandolph WelteMaurice van den DobbelsteenBill RobertsJeffrey van der HoevenJasper SchroderMany digital preservation scenarios are based on the migration strategy, which itself is heavily tool-dependent. For popular, well-defined and often open file formats – e.g., digital images, such as PNG, GIF, JPEG – a wide range of tools exist. Migration workflows become more difficult with proprietary formats, as used by the several text processing applications becoming available in the last two decades. If a certain file format can not be rendered with actual software, emulation of the original environment remains a valid option. For instance, with the original Lotus AmiPro or Word Perfect, it is not a problem to save an object of this type in ASCII text or Rich Text Format. In specific environments, it is even possible to send the file to a virtual printer, thereby producing a PDF as a migration output. Such manual migration tasks typically involve human interaction, which may be feasible for a small number of objects, but not for larger batches of files.<br /><br />We propose a novel approach using a software-operated VNC abstraction layer in order to replace humans with machine interaction. Emulators or virtualization tools equipped with a VNC interface are very well suited for this approach. But screen, keyboard and mouse interaction is just part of the setup. Furthermore, digital objects need to be transferred into the original environment in order to be extracted after processing. Nevertheless, the complexity of the new generation of migration services is quickly rising; a preservation workflow is now comprised not only of the migration tool itself, but of a complete software and virtual hardware stack with recorded workflows linked to every supported migration scenario. Thus the requirements of OAIS management must include proper software archiving, emulator selection, system image and recording handling. The concept of view-paths could help either to automatically determine the proper pre-configured virtual environment or to set up system images for certain migration workflows. View-paths may rise in demand, as the generation of PDF output files from Word Perfect input could be cached as pre-fabricated emulator system images. The current groundwork provides several possible optimizations, such as using the automation features of the original environments.<br />http://www.ijdc.net/index.php/ijdc/article/view/172
collection DOAJ
language English
format Article
sources DOAJ
author Dirk von Suchodoletz
Klaus Rechert
Randolph Welte
Maurice van den Dobbelsteen
Bill Roberts
Jeffrey van der Hoeven
Jasper Schroder
spellingShingle Dirk von Suchodoletz
Klaus Rechert
Randolph Welte
Maurice van den Dobbelsteen
Bill Roberts
Jeffrey van der Hoeven
Jasper Schroder
Automation of Flexible Migration Workflows
International Journal of Digital Curation
author_facet Dirk von Suchodoletz
Klaus Rechert
Randolph Welte
Maurice van den Dobbelsteen
Bill Roberts
Jeffrey van der Hoeven
Jasper Schroder
author_sort Dirk von Suchodoletz
title Automation of Flexible Migration Workflows
title_short Automation of Flexible Migration Workflows
title_full Automation of Flexible Migration Workflows
title_fullStr Automation of Flexible Migration Workflows
title_full_unstemmed Automation of Flexible Migration Workflows
title_sort automation of flexible migration workflows
publisher University of Edinburgh
series International Journal of Digital Curation
issn 1746-8256
publishDate 2011-03-01
description Many digital preservation scenarios are based on the migration strategy, which itself is heavily tool-dependent. For popular, well-defined and often open file formats – e.g., digital images, such as PNG, GIF, JPEG – a wide range of tools exist. Migration workflows become more difficult with proprietary formats, as used by the several text processing applications becoming available in the last two decades. If a certain file format can not be rendered with actual software, emulation of the original environment remains a valid option. For instance, with the original Lotus AmiPro or Word Perfect, it is not a problem to save an object of this type in ASCII text or Rich Text Format. In specific environments, it is even possible to send the file to a virtual printer, thereby producing a PDF as a migration output. Such manual migration tasks typically involve human interaction, which may be feasible for a small number of objects, but not for larger batches of files.<br /><br />We propose a novel approach using a software-operated VNC abstraction layer in order to replace humans with machine interaction. Emulators or virtualization tools equipped with a VNC interface are very well suited for this approach. But screen, keyboard and mouse interaction is just part of the setup. Furthermore, digital objects need to be transferred into the original environment in order to be extracted after processing. Nevertheless, the complexity of the new generation of migration services is quickly rising; a preservation workflow is now comprised not only of the migration tool itself, but of a complete software and virtual hardware stack with recorded workflows linked to every supported migration scenario. Thus the requirements of OAIS management must include proper software archiving, emulator selection, system image and recording handling. The concept of view-paths could help either to automatically determine the proper pre-configured virtual environment or to set up system images for certain migration workflows. View-paths may rise in demand, as the generation of PDF output files from Word Perfect input could be cached as pre-fabricated emulator system images. The current groundwork provides several possible optimizations, such as using the automation features of the original environments.<br />
url http://www.ijdc.net/index.php/ijdc/article/view/172
work_keys_str_mv AT dirkvonsuchodoletz automationofflexiblemigrationworkflows
AT klausrechert automationofflexiblemigrationworkflows
AT randolphwelte automationofflexiblemigrationworkflows
AT mauricevandendobbelsteen automationofflexiblemigrationworkflows
AT billroberts automationofflexiblemigrationworkflows
AT jeffreyvanderhoeven automationofflexiblemigrationworkflows
AT jasperschroder automationofflexiblemigrationworkflows
_version_ 1725468711779827712