Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization

As collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file...

Full description

Bibliographic Details
Main Authors: Maria Esteva, Weijia Xu, Suyog Dutt Jain, Jennifer L. Lee, Wendy K. Martin
Format: Article
Language:English
Published: University of Edinburgh 2011-03-01
Series:International Journal of Digital Curation
Online Access:http://www.ijdc.net/index.php/ijdc/article/view/162
id doaj-f447244852084cfab12708b45f590625
record_format Article
spelling doaj-f447244852084cfab12708b45f5906252020-11-24T22:13:53ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562011-03-0161455710.2218/ijdc.v6i1.171154Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with VisualizationMaria EstevaWeijia XuSuyog Dutt JainJennifer L. LeeWendy K. MartinAs collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file characterization information for the purpose of assessing the preservation condition of a vast collection of complex electronic records. The case study collection contains over 1,000,000 files of diverse formats arranged in varied record structures and record groups. The visualization application uses tree maps and a relational database management system (RDBMS) to represent the collection's arrangement and to show available characterization information at different levels of aggregation, classification and abstraction. Through this visualization interface curators can interact dynamically with the collections' characterization information to discover trends, as well as compare and contrast various file characteristics across the collection. Curators may select and weight the variables that they want to analyze. They can pursue analysis workflows that go from a high-level overview of the collection's preservation condition based on file format risks, to obtaining more detailed results about the condition of record groups and individual records. While there are various digital preservation planning tools available, to our knowledge none have been designed specifically to visually present assessment information across vast and complex collections. We present research to address the need for such a tool.http://www.ijdc.net/index.php/ijdc/article/view/162
collection DOAJ
language English
format Article
sources DOAJ
author Maria Esteva
Weijia Xu
Suyog Dutt Jain
Jennifer L. Lee
Wendy K. Martin
spellingShingle Maria Esteva
Weijia Xu
Suyog Dutt Jain
Jennifer L. Lee
Wendy K. Martin
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
International Journal of Digital Curation
author_facet Maria Esteva
Weijia Xu
Suyog Dutt Jain
Jennifer L. Lee
Wendy K. Martin
author_sort Maria Esteva
title Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
title_short Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
title_full Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
title_fullStr Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
title_full_unstemmed Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
title_sort assessing the preservation condition of large and heterogeneous electronic records collections with visualization
publisher University of Edinburgh
series International Journal of Digital Curation
issn 1746-8256
publishDate 2011-03-01
description As collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file characterization information for the purpose of assessing the preservation condition of a vast collection of complex electronic records. The case study collection contains over 1,000,000 files of diverse formats arranged in varied record structures and record groups. The visualization application uses tree maps and a relational database management system (RDBMS) to represent the collection's arrangement and to show available characterization information at different levels of aggregation, classification and abstraction. Through this visualization interface curators can interact dynamically with the collections' characterization information to discover trends, as well as compare and contrast various file characteristics across the collection. Curators may select and weight the variables that they want to analyze. They can pursue analysis workflows that go from a high-level overview of the collection's preservation condition based on file format risks, to obtaining more detailed results about the condition of record groups and individual records. While there are various digital preservation planning tools available, to our knowledge none have been designed specifically to visually present assessment information across vast and complex collections. We present research to address the need for such a tool.
url http://www.ijdc.net/index.php/ijdc/article/view/162
work_keys_str_mv AT mariaesteva assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization
AT weijiaxu assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization
AT suyogduttjain assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization
AT jenniferllee assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization
AT wendykmartin assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization
_version_ 1725799490461368320