Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement
The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related element...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Code4Lib
2019-11-01
|
Series: | Code4Lib Journal |
Online Access: | https://journal.code4lib.org/articles/14834 |
id |
doaj-cba1d96b94e9438eb17567e849393837 |
---|---|
record_format |
Article |
spelling |
doaj-cba1d96b94e9438eb17567e8493938372020-11-25T03:54:20ZengCode4LibCode4Lib Journal1940-57582019-11-014614834Natural Language Processing in the Humanities: A Case Study in Automated Metadata EnhancementErin WolfeThe Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.https://journal.code4lib.org/articles/14834 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Erin Wolfe |
spellingShingle |
Erin Wolfe Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement Code4Lib Journal |
author_facet |
Erin Wolfe |
author_sort |
Erin Wolfe |
title |
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement |
title_short |
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement |
title_full |
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement |
title_fullStr |
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement |
title_full_unstemmed |
Natural Language Processing in the Humanities: A Case Study in Automated Metadata Enhancement |
title_sort |
natural language processing in the humanities: a case study in automated metadata enhancement |
publisher |
Code4Lib |
series |
Code4Lib Journal |
issn |
1940-5758 |
publishDate |
2019-11-01 |
description |
The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions. |
url |
https://journal.code4lib.org/articles/14834 |
work_keys_str_mv |
AT erinwolfe naturallanguageprocessinginthehumanitiesacasestudyinautomatedmetadataenhancement |
_version_ |
1724474233435717632 |