Mdmap: A Tool for Metadata Collection and Matching

This paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires hi...

Full description

Bibliographic Details
Main Author: Rico Simke
Format: Article
Language:English
Published: Code4Lib 2014-10-01
Series:Code4Lib Journal
Online Access:http://journal.code4lib.org/articles/10055
id doaj-13788cdc76624522a674cafccf84a77d
record_format Article
spelling doaj-13788cdc76624522a674cafccf84a77d2020-11-25T02:25:15ZengCode4LibCode4Lib Journal1940-57582014-10-012610055Mdmap: A Tool for Metadata Collection and MatchingRico SimkeThis paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires high-quality bibliographic metadata, but currently only sparse metadata from digitized editions is available. The project’s approach is to collect metadata for each digitized item from as many sources as possible. An expert user can then use an intuitive front-end tool to choose matching metadata. The collected metadata are centrally displayed in an interactive grid view. The user can choose which metadata they want to assign to a certain edition, and export these data as MARCXML. This paper presents a new approach to bibliographic work and metadata correction. We try to achieve a high quality of the metadata by generating a large amount of metadata to choose from, as well as by giving librarians an intuitive tool to manage their data.http://journal.code4lib.org/articles/10055
collection DOAJ
language English
format Article
sources DOAJ
author Rico Simke
spellingShingle Rico Simke
Mdmap: A Tool for Metadata Collection and Matching
Code4Lib Journal
author_facet Rico Simke
author_sort Rico Simke
title Mdmap: A Tool for Metadata Collection and Matching
title_short Mdmap: A Tool for Metadata Collection and Matching
title_full Mdmap: A Tool for Metadata Collection and Matching
title_fullStr Mdmap: A Tool for Metadata Collection and Matching
title_full_unstemmed Mdmap: A Tool for Metadata Collection and Matching
title_sort mdmap: a tool for metadata collection and matching
publisher Code4Lib
series Code4Lib Journal
issn 1940-5758
publishDate 2014-10-01
description This paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires high-quality bibliographic metadata, but currently only sparse metadata from digitized editions is available. The project’s approach is to collect metadata for each digitized item from as many sources as possible. An expert user can then use an intuitive front-end tool to choose matching metadata. The collected metadata are centrally displayed in an interactive grid view. The user can choose which metadata they want to assign to a certain edition, and export these data as MARCXML. This paper presents a new approach to bibliographic work and metadata correction. We try to achieve a high quality of the metadata by generating a large amount of metadata to choose from, as well as by giving librarians an intuitive tool to manage their data.
url http://journal.code4lib.org/articles/10055
work_keys_str_mv AT ricosimke mdmapatoolformetadatacollectionandmatching
_version_ 1724852269945454592