Levels and building blocks—toward a domain granularity framework for the life sciences

Abstract Background With the emergence of high-throughput technologies, Big Data and eScience, the use of online data repositories and the establishment of new data standards that require data to be computer-parsable become increasingly important. As a consequence, there is an increasing need for an...

Full description

Bibliographic Details
Main Author: Lars Vogt
Format: Article
Language:English
Published: BMC 2019-01-01
Series:Journal of Biomedical Semantics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13326-019-0196-2
Description
Summary:Abstract Background With the emergence of high-throughput technologies, Big Data and eScience, the use of online data repositories and the establishment of new data standards that require data to be computer-parsable become increasingly important. As a consequence, there is an increasing need for an integrated system of hierarchies of levels of different types of material entities that helps with organizing, structuring and integrating data from disparate sources to facilitate data exploration, data comparison and analysis. Theories of granularity provide such integrated systems. Results On the basis of formal approaches to theories of granularity authored by information scientists and ontology researchers, I discuss the shortcomings of some applications of the concept of levels and argue that the general theory of granularity proposed by Keet circumvents these problems. I introduce the concept of building blocks, which gives rise to a hierarchy of levels that can be formally characterized by Keet’s theory. This hierarchy functions as an organizational backbone for integrating various other hierarchies that I briefly discuss, resulting in a domain granularity framework for the life sciences. I also discuss the consequences of this granularity framework for the structure of the top-level category of ‘material entity’ in Basic Formal Ontology. Conclusions The domain granularity framework suggested here is meant to provide the basis on which a more comprehensive information framework for the life sciences can be developed, which would provide the much needed conceptual framework for representing domains that cover multiple granularity levels. This framework can be used for intuitively structuring data in the life sciences, facilitating data exploration, and it can be employed for reasoning over different granularity levels across different hierarchies. It would provide a methodological basis for establishing comparability between data sets and for quantitatively measuring their degree of semantic similarity.
ISSN:2041-1480