Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization
As collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Edinburgh
2011-03-01
|
Series: | International Journal of Digital Curation |
Online Access: | http://www.ijdc.net/index.php/ijdc/article/view/162 |
id |
doaj-f447244852084cfab12708b45f590625 |
---|---|
record_format |
Article |
spelling |
doaj-f447244852084cfab12708b45f5906252020-11-24T22:13:53ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562011-03-0161455710.2218/ijdc.v6i1.171154Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with VisualizationMaria EstevaWeijia XuSuyog Dutt JainJennifer L. LeeWendy K. MartinAs collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file characterization information for the purpose of assessing the preservation condition of a vast collection of complex electronic records. The case study collection contains over 1,000,000 files of diverse formats arranged in varied record structures and record groups. The visualization application uses tree maps and a relational database management system (RDBMS) to represent the collection's arrangement and to show available characterization information at different levels of aggregation, classification and abstraction. Through this visualization interface curators can interact dynamically with the collections' characterization information to discover trends, as well as compare and contrast various file characteristics across the collection. Curators may select and weight the variables that they want to analyze. They can pursue analysis workflows that go from a high-level overview of the collection's preservation condition based on file format risks, to obtaining more detailed results about the condition of record groups and individual records. While there are various digital preservation planning tools available, to our knowledge none have been designed specifically to visually present assessment information across vast and complex collections. We present research to address the need for such a tool.http://www.ijdc.net/index.php/ijdc/article/view/162 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Maria Esteva Weijia Xu Suyog Dutt Jain Jennifer L. Lee Wendy K. Martin |
spellingShingle |
Maria Esteva Weijia Xu Suyog Dutt Jain Jennifer L. Lee Wendy K. Martin Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization International Journal of Digital Curation |
author_facet |
Maria Esteva Weijia Xu Suyog Dutt Jain Jennifer L. Lee Wendy K. Martin |
author_sort |
Maria Esteva |
title |
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization |
title_short |
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization |
title_full |
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization |
title_fullStr |
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization |
title_full_unstemmed |
Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization |
title_sort |
assessing the preservation condition of large and heterogeneous electronic records collections with visualization |
publisher |
University of Edinburgh |
series |
International Journal of Digital Curation |
issn |
1746-8256 |
publishDate |
2011-03-01 |
description |
As collections become larger in size, more complex in structure and increasingly diverse in composition, new approaches are needed to help curators assess digital files and make decisions about their long-term preservation. We present research on the use of interactive visualization to analyze file characterization information for the purpose of assessing the preservation condition of a vast collection of complex electronic records. The case study collection contains over 1,000,000 files of diverse formats arranged in varied record structures and record groups. The visualization application uses tree maps and a relational database management system (RDBMS) to represent the collection's arrangement and to show available characterization information at different levels of aggregation, classification and abstraction. Through this visualization interface curators can interact dynamically with the collections' characterization information to discover trends, as well as compare and contrast various file characteristics across the collection. Curators may select and weight the variables that they want to analyze. They can pursue analysis workflows that go from a high-level overview of the collection's preservation condition based on file format risks, to obtaining more detailed results about the condition of record groups and individual records. While there are various digital preservation planning tools available, to our knowledge none have been designed specifically to visually present assessment information across vast and complex collections. We present research to address the need for such a tool. |
url |
http://www.ijdc.net/index.php/ijdc/article/view/162 |
work_keys_str_mv |
AT mariaesteva assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization AT weijiaxu assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization AT suyogduttjain assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization AT jenniferllee assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization AT wendykmartin assessingthepreservationconditionoflargeandheterogeneouselectronicrecordscollectionswithvisualization |
_version_ |
1725799490461368320 |