Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control

This paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded....

Full description

Bibliographic Details
Main Author: Maureen P. Walsh
Format: Article
Language:English
Published: American Library Association 2010-09-01
Series:Information Technology and Libraries
Online Access:https://ejournals.bc.edu/ojs/index.php/ital/article/view/3137
Description
Summary:This paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded. Most of the batch loads utilized Perl scripts to automate the process of importing metadata and content files. Custom Perl scripts were used to migrate data from spreadsheets or comma-separated values files into the DSpace archive directory format, to build collections and tables of contents, and to provide data quality control. Two projects are described to illustrate the process and workflows.
ISSN:0730-9295
2163-5226