Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement

The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related element...

Full description

Bibliographic Details
Main Author: Erin Wolfe
Format: Article
Language:English
Published: Code4Lib 2019-11-01
Series:Code4Lib Journal
Online Access:https://journal.code4lib.org/articles/14834
id doaj-cba1d96b94e9438eb17567e849393837
record_format Article
spelling doaj-cba1d96b94e9438eb17567e8493938372020-11-25T03:54:20ZengCode4LibCode4Lib Journal1940-57582019-11-014614834Natural Language Processing in the Humanities: A Case Study in Automated Metadata EnhancementErin WolfeThe Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.https://journal.code4lib.org/articles/14834
collection DOAJ
language English
format Article
sources DOAJ
author Erin Wolfe
spellingShingle Erin Wolfe
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
Code4Lib Journal
author_facet Erin Wolfe
author_sort Erin Wolfe
title Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
title_short Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
title_full Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
title_fullStr Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
title_full_unstemmed Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
title_sort natural language processing in the humanities: a case study in automated metadata enhancement
publisher Code4Lib
series Code4Lib Journal
issn 1940-5758
publishDate 2019-11-01
description The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.
url https://journal.code4lib.org/articles/14834
work_keys_str_mv AT erinwolfe naturallanguageprocessinginthehumanitiesacasestudyinautomatedmetadataenhancement
_version_ 1724474233435717632