Detailseite
Projekt Druckansicht

Kontinuierliche Qualitätskontrolle von Forschungsdaten zur Sicherung ihrer Reproduzierbarkeit an Hochschulen (CONQUAIRE)

Antragstellerinnen / Antragsteller Professor Dr. Philipp Cimiano; Barbara Knorn
Fachliche Zuordnung Sicherheit und Verlässlichkeit, Betriebs-, Kommunikations- und verteilte Systeme
Theoretische Informatik
Förderung Förderung von 2015 bis 2020
Projektkennung Deutsche Forschungsgemeinschaft (DFG) - Projektnummer 277747081
 
Erstellungsjahr 2019

Zusammenfassung der Projektergebnisse

The Conquaire project has analyzed in detail eight case studies in computational reproducibility involving research groups from areas as varied as computer science / robotics, psychology, linguistics, biology and chemistry. On the basis of accompanying the work of these groups over three years, it has developed a detailed understanding of the variety and heterogeneity of analytical research workflows involved. In each of these case studies, Conquaire has managed to independently reproduce a central result published in one of the papers of the groups involved in the case studies. As a result of the project, the scripts and data for the above mentioned use cases is available in a university-wide Git system. The main obstacles for analytical reproducibility found were i) the lack of documentation and thus reliance on guidance by the original authors, ii) the reliance on some manual steps in the analytical workflow (e.g. clicking on a GUI) , iii) the reliance on non-open and commercial software, and iv) lack of information about which particular version of software and/or data was used to generate a specific results. In terms of infrastructure, Conquaire has developed infrastructure on top of a Git system that allows researchers to commit their data early in the research process into a distributed versioning system, with the benefit of providing a backup service but most importantly versioning the data and making different versions of the data referenceable. The project has also implemented continuous integration principles on top of the Git system, allowing researchers to define tests that their data have to pass as a basis to ensure data quality. It has implemented a badge system that publishes the results of the tests via the Bielefeld University PUB system to create incentives for researchers to make their data consistent and ready to be reused by others. The use of social rewards was an interesting idea to explore, yet it remains to be seen if this sort of incentive-creating mechanisms is accepted by the community of researchers. Overall, the Conquaire project has provided proof-of-concept that analytical reproducibility is indeed feasible and can be effectively supported by an institutional approach and infrastructure that support for scientists to provide their code and data into an institutional repository if not a public repository as a first step to making artifacts referenceable and accessible in line with the FAIR principles.

Projektbezogene Publikationen (Auswahl)

 
 

Zusatzinformationen

Textvergrößerung und Kontrastanpassung