Project Details
Projekt Print View

SRPtn: solving the last-mile problem of in-silico reproducibility

Subject Area Bioinformatics and Theoretical Biology
Term since 2026
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 574526432
 
Reproduciblity is a central goal for any scientific data analysis. However, varying computational experience in interdisciplinary teams, time pressure, and technical difficulties can lead to the loss of reproducibility shortly before a manuscript is published or during the review process: while initially researchers might have used reproducibility frameworks like Snakemake, final analysis steps are often still done outside of such systems. We call this the "last mile problem" of data analysis. In our proposal, we suggest developing a graphical analysis platform around Snakemake. The platform shall bridge the gap between computational and non-computational researchers by offering a high level and assistance rich interface for the configuration and execution of Snakemake data analysis workflows. To solve the last mile problem, the platform will enable the extension of such workflows by allowing them to generate new plotting or filtering steps with AI/ML assistance and intuitive user interface elements. Any such extensions will however be automatically be integrated back into the Snakemake data analysis workflow, such that the otherwise common scientific reproducibility loss is avoided.
DFG Programme Research data and software (Scientific Library Services and Information Systems)
 
 

Additional Information

Textvergrößerung und Kontrastanpassung