Summary: | The adoption of the Semantic Web by life sciences provides exciting opportunities for exploring novel ways to conduct biomedical research. Particularly, the approval of the Web Ontology Language (OWL) by the World Wide Web Consortium (W3C) has provided a global standard for shared representation of biomedical knowledge in the form of ontologies. However, though there are numerous examples of bio-ontologies being used to describe “what is” (i.e., a universal view of reality), there is a dearth of examples where ontologies are used to describe “what might be” (i.e., a hypothetical view of reality).
This thesis proposes that, to achieve scientific rigor, it is important to consider approaches for explicitly representing subjective knowledge in a particular domain - in particular, we examine phenotypic classifications in the clinical domain. We provide supporting evidence, that the OWL is suitable for formally representing these subjective perspectives. Envisioning ontologies as hypothetical, contextual and subjective is a notable departure from the commonly-held view in the life-sciences, where ontologies (ostensibly) represent some "universal truth". We support these arguments with both empirical and quantitative studies. We demonstrate that, when expressed in OWL, many phenotypic classification systems can be accurately modeled in silico. We then demonstrate that these knowledge-models can be "personalized", and show that such models can enable the automated analysis of data in a transparent manner. This results in more rigorous clinical research, while simultaneously allowing the clinicians to maintain their role as the final arbiters of decisions.
Finally, we investigate methodologies that might facilitate the encoding and sharing of personalized expert-knowledge by non-knowledge-engineers - a necessary step in making these ideas useful to the clinical community. The knowledge-acquisition bottleneck is the primary barrier to the widespread use of ontologies in life sciences. Thus, we investigate data-driven methodologies to automatically extract knowledge from existing data-systems, and show that it is possible to "boot-strap" the construction of knowledge models through various data-mining algorithms.
Taken together, these studies begin to reveal a path toward Web-embedded, in silico clinical research, where knowledge is explicit, transparent, personalized, modular, globally-shared, re-used, and dynamically applied to the interpretation and analysis of clinical datasets.
|