Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform

A big challenge associated with getting an institutional repository off the ground is getting content into it. This article will look at how to use digitization services at the Internet Archive alongside software utilities that the author developed to automate the harvesting of scanned dissertations...

Full description

Bibliographic Details
Main Author: Tim Ribaric
Format: Article
Language:English
Published: Code4Lib 2009-11-01
Series:Code4Lib Journal
Online Access:http://journal.code4lib.org/articles/2152
id doaj-f6941e21c5e44f9b9ddbe9b75f644c21
record_format Article
spelling doaj-f6941e21c5e44f9b9ddbe9b75f644c212020-11-25T03:41:36ZengCode4LibCode4Lib Journal1940-57582009-11-0182152Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository PlatformTim RibaricA big challenge associated with getting an institutional repository off the ground is getting content into it. This article will look at how to use digitization services at the Internet Archive alongside software utilities that the author developed to automate the harvesting of scanned dissertations and associated Dublin Core XML files to create an ETD Portal using the DSpace platform. The end result is a metadata-rich, full-text collection of theses that can be constructed for little out of pocket cost. http://journal.code4lib.org/articles/2152
collection DOAJ
language English
format Article
sources DOAJ
author Tim Ribaric
spellingShingle Tim Ribaric
Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
Code4Lib Journal
author_facet Tim Ribaric
author_sort Tim Ribaric
title Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
title_short Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
title_full Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
title_fullStr Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
title_full_unstemmed Automatic Preparation of ETD Material from the Internet Archive for the DSpace Repository Platform
title_sort automatic preparation of etd material from the internet archive for the dspace repository platform
publisher Code4Lib
series Code4Lib Journal
issn 1940-5758
publishDate 2009-11-01
description A big challenge associated with getting an institutional repository off the ground is getting content into it. This article will look at how to use digitization services at the Internet Archive alongside software utilities that the author developed to automate the harvesting of scanned dissertations and associated Dublin Core XML files to create an ETD Portal using the DSpace platform. The end result is a metadata-rich, full-text collection of theses that can be constructed for little out of pocket cost.
url http://journal.code4lib.org/articles/2152
work_keys_str_mv AT timribaric automaticpreparationofetdmaterialfromtheinternetarchiveforthedspacerepositoryplatform
_version_ 1724529359827501056