Methods for Extracting Relational Data from Unstructured Texts Prior to Network Visualization in Humanities Research

Network modelling methodologies in the digital humanities have been be used to advance inquiry in a variety of areas—most commonly those having to do with correspondence, citation, and social media networks. While new technologies have made generating high-quality and even dynamic network visualizat...

Full description

Bibliographic Details
Main Authors: S. Scott Graham, Zoltan P. Majdik, Dave Clark
Format: Article
Language:English
Published: Ubiquity Press 2020-11-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://openhumanitiesdata.metajnl.com/articles/21
Description
Summary:Network modelling methodologies in the digital humanities have been be used to advance inquiry in a variety of areas—most commonly those having to do with correspondence, citation, and social media networks. While new technologies have made generating high-quality and even dynamic network visualizations relatively easy, key challenges remain for humanities researchers. Many common objects of humanistic inquiry, such as literary, historiographic, and biographical texts are often not easily transformed into the kinds of data structures necessary for network visualization. The Transparency to Visibility (T2V) Project was initiated to develop new methods and toolkits that can support humanistic researchers who need to extract relationship data from unstructured texts to support network visualization. The T2V team used bioethics accountability statements to pilot and evaluate different methods for extracting relationship data. The resulting machine-learning-enhanced natural language processing (NLP) and metadata-assisted approaches offer promising potential pathways for contemporary digital humanities and future toolkit development.   Funding statement: Funding was provided by a National Endowment for the Humanities Digital Humanities Advancement grant (HAA-261070).
ISSN:2059-481X