Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control

This paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded....

Full description

Bibliographic Details
Main Author: Maureen P. Walsh
Format: Article
Language:English
Published: American Library Association 2010-09-01
Series:Information Technology and Libraries
Online Access:https://ejournals.bc.edu/ojs/index.php/ital/article/view/3137
id doaj-910eb7a7ef784bef9142d351089e1e06
record_format Article
spelling doaj-910eb7a7ef784bef9142d351089e1e062020-11-24T23:41:37ZengAmerican Library AssociationInformation Technology and Libraries0730-92952163-52262010-09-0129311712710.6017/ital.v29i3.31372804Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality ControlMaureen P. WalshThis paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded. Most of the batch loads utilized Perl scripts to automate the process of importing metadata and content files. Custom Perl scripts were used to migrate data from spreadsheets or comma-separated values files into the DSpace archive directory format, to build collections and tables of contents, and to provide data quality control. Two projects are described to illustrate the process and workflows.https://ejournals.bc.edu/ojs/index.php/ital/article/view/3137
collection DOAJ
language English
format Article
sources DOAJ
author Maureen P. Walsh
spellingShingle Maureen P. Walsh
Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
Information Technology and Libraries
author_facet Maureen P. Walsh
author_sort Maureen P. Walsh
title Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
title_short Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
title_full Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
title_fullStr Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
title_full_unstemmed Batch Loading Collections into DSpace: Using Perl Scripts for Automation and Quality Control
title_sort batch loading collections into dspace: using perl scripts for automation and quality control
publisher American Library Association
series Information Technology and Libraries
issn 0730-9295
2163-5226
publishDate 2010-09-01
description This paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded. Most of the batch loads utilized Perl scripts to automate the process of importing metadata and content files. Custom Perl scripts were used to migrate data from spreadsheets or comma-separated values files into the DSpace archive directory format, to build collections and tables of contents, and to provide data quality control. Two projects are described to illustrate the process and workflows.
url https://ejournals.bc.edu/ojs/index.php/ital/article/view/3137
work_keys_str_mv AT maureenpwalsh batchloadingcollectionsintodspaceusingperlscriptsforautomationandqualitycontrol
_version_ 1725506333078192128