Project Details
Projekt Print View

Digitisation / Cataloguing of non-textual objects: eScience-compliant standards for morphology

Subject Area Systematics and Morphology (Zoology)
Term from 2014 to 2019
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 248394582
 
Final Report Year 2019

Final Report Abstract

In order to achieve our goal of a highly flexible application that fully exploits semantic technology, we have deviated substantially from the work plan originally outlined in the proposal (we got the approval from the DFG after consultation). Therefore, before we developed a prototype application for generating Anatomy Knowledge Graphs , we developed the Semantic Programming Ontology (SPrO) with an accompanying Java-based middleware that we used as a semantic programming language. Resources from SPrO function as commands, attributes, and variables and can be used for describing web-based data-centric applications, with each description forming an ontology in its own right, i.e., the application's source code ontology (SCO). The SCO thus provides the steering logic for the application. The accompanying middleware functions as an interpreter, treating descriptions in an SCO as specifications of the application and dynamically executes them. The middleware produces the application and controls its behavior. The Semantic Programming approach provides a development framework that not only seamlessly integrates RDF with HTML but also allows domain experts to develop their own data-centric applications with as little programming experience required as possible. With its clear separation of steering logic from interpretation logic, semantic programming follows the idea of separating main layers of an application, analog to the separation of interpretation logic and presentation logic. With SPrO and its accompanying middleware, semantic programming provides a basic development framework that supports developers of knowledge graph applications. We have used SPrO for describing a semantic web content management system (S-WCMS), called SOCCOMAS, that stores all data and metadata as semantic knowledge graphs. The SCO of SOCCOMAS contains descriptions of ready-to-use features and workflows typically required by many data-rich web applications, including user administration, login and user registration, session management, user profiles, publication-life-cycle processes (for current draft, backup, recycle bin, deleted draft, current published, and previously published versions), and automatic procedures for tracking overall provenance (creator, authors, creation and publication date, contributors, relation between different versions), and for tracking all changes made to a particular data record at the level of individual entry fields. All the gathered metadata are recorded as semantic knowledge graphs following established metadata standards. Users of SOCCOMAS do not have to interact directly with data in the form of semantic graphs, because SOCCOMAS makes the data and metadata contained in semantic knowledge graphs accessible through Websites. Since every document is published under a Creative Commons license, and since all data and metadata is documented as semantic knowledge graphs that are also accessible through a SPARQL endpoint, all data published by an S-WCMS run by SOCCOMAS meet Tim Berners-Lee's 5-star Linked Open Data principles and comply with the FAIR data principles. Using SOCCOMAS and semantic programming, we have developed a module for morphological descriptions for the morphological data repository Morph D Base, utilizing all from above. Additional features have been added through MDB’S own SCO. Semantic Morph·D·Base enables users to generate highly standardized and formalized morphological descriptions that are stored in a tuple store framework as semantic Anatomy Knowledge Graphs. When describing an anatomical structure, users can reference any ontology class from any anatomy ontology that is available at "BioPortal" and describe the structure and all of its parts as instances of these classes. Parts can be further described through defined data entry forms. Semantic Morph D Base is still in development, but a "prototype" can be accessed and functions as a proof of concept for SOCCOMAS and our semantic programming approach. Using SOCCOMAS and semantic programming for developing the module for morphological descriptions has proven to save valuable resources and development time. The SCO for the semantic Morph·D·Base prototype has been written by a domain expert with knowledge in ontology engineering but no expertise in any programming language. Furthermore, the approach has also proven that changes to the organization of the graphical user interface, such as adding a new entry field, can be conducted on the fly, which facilitates a user-centered design approach to application development. This has greatly reduces subsequent GUI-optimization workload. All code is available from our GitHub page . The project also resulted in a community effort to propose a common data model for "Anatomy". "Knowledge Graphs", the corresponding publication of which is currently in preparation.

Publications

 
 

Additional Information

Textvergrößerung und Kontrastanpassung