Mdmap: A Tool for Metadata Collection and Matching

This paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires hi...

Full description

Bibliographic Details
Main Author:	Rico Simke
Format:	Article
Language:	English
Published:	Code4Lib 2014-10-01
Series:	Code4Lib Journal
Online Access:	http://journal.code4lib.org/articles/10055

id	doaj-13788cdc76624522a674cafccf84a77d
record_format	Article
spelling	doaj-13788cdc76624522a674cafccf84a77d2020-11-25T02:25:15ZengCode4LibCode4Lib Journal1940-57582014-10-012610055Mdmap: A Tool for Metadata Collection and MatchingRico SimkeThis paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires high-quality bibliographic metadata, but currently only sparse metadata from digitized editions is available. The project’s approach is to collect metadata for each digitized item from as many sources as possible. An expert user can then use an intuitive front-end tool to choose matching metadata. The collected metadata are centrally displayed in an interactive grid view. The user can choose which metadata they want to assign to a certain edition, and export these data as MARCXML. This paper presents a new approach to bibliographic work and metadata correction. We try to achieve a high quality of the metadata by generating a large amount of metadata to choose from, as well as by giving librarians an intuitive tool to manage their data.http://journal.code4lib.org/articles/10055
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rico Simke
spellingShingle	Rico Simke Mdmap: A Tool for Metadata Collection and Matching Code4Lib Journal
author_facet	Rico Simke
author_sort	Rico Simke
title	Mdmap: A Tool for Metadata Collection and Matching
title_short	Mdmap: A Tool for Metadata Collection and Matching
title_full	Mdmap: A Tool for Metadata Collection and Matching
title_fullStr	Mdmap: A Tool for Metadata Collection and Matching
title_full_unstemmed	Mdmap: A Tool for Metadata Collection and Matching
title_sort	mdmap: a tool for metadata collection and matching
publisher	Code4Lib
series	Code4Lib Journal
issn	1940-5758
publishDate	2014-10-01
description	This paper describes a front-end for the semi-automatic collection, matching, and generation of bibliographic metadata obtained from different sources for use within a digitization architecture. The Library of a Billion Words project is building an infrastructure for digitizing text that requires high-quality bibliographic metadata, but currently only sparse metadata from digitized editions is available. The project’s approach is to collect metadata for each digitized item from as many sources as possible. An expert user can then use an intuitive front-end tool to choose matching metadata. The collected metadata are centrally displayed in an interactive grid view. The user can choose which metadata they want to assign to a certain edition, and export these data as MARCXML. This paper presents a new approach to bibliographic work and metadata correction. We try to achieve a high quality of the metadata by generating a large amount of metadata to choose from, as well as by giving librarians an intuitive tool to manage their data.
url	http://journal.code4lib.org/articles/10055
work_keys_str_mv	AT ricosimke mdmapatoolformetadatacollectionandmatching
_version_	1724852269945454592

Mdmap: A Tool for Metadata Collection and Matching

Similar Items