Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform

A big challenge associated with getting an institutional repository off the ground is getting content into it. This article will look at how to use digitization services at the Internet Archive alongside software utilities that the author developed to automate the harvesting of scanned dissertations...

Full description

Bibliographic Details
Main Author: Tim Ribaric
Format: Article
Language:English
Published: Code4Lib 2009-11-01
Series:Code4Lib Journal
Online Access:http://journal.code4lib.org/articles/2152
Description
Summary:A big challenge associated with getting an institutional repository off the ground is getting content into it. This article will look at how to use digitization services at the Internet Archive alongside software utilities that the author developed to automate the harvesting of scanned dissertations and associated Dublin Core XML files to create an ETD Portal using the DSpace platform. The end result is a metadata-rich, full-text collection of theses that can be constructed for little out of pocket cost.
ISSN:1940-5758