Project Details
Projekt Print View

Automated characterization of microbial genomes and metagenomes by collection and verification of association rules

Subject Area Bioinformatics and Theoretical Biology
Term from 2019 to 2023
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 421071204
 
Final Report Year 2024

Final Report Abstract

In the last years, sequencing prokaryotic genomes became increasingly simple and cheap, thanks to the high quality sequencing technologies, such as the platforms offered by Illumina, Nanopore, Pacific Biosciences and other, as well as the powerful bioinformatics software available for the analysis of the results, which allows to easily assemble the sequencing reads and annotate the resulting assembly. For this reason, it is becoming increasingly normal to just sequence and assemble genomes of bacteria and archaea, without this to be followed by a thorough analysis of their contents. An annotation often includes some thousand genes and other features, and manual analyses are time consuming. Furthermore, they are increasingly less rewarding, since just the sequencing and annotation of a prokaryotic genome became too easy to represent a significant scientific advance, and does not by itself justify a publication in a highly ranked journal. This project had the goal to create a system, which can improve the situation, by automatically finding the most interesting parts of a prokaryotic genome, based on expectations about its contents, prior to the genome analysis. Although interest can be subjective, the analysis of scientific literature in the field of genomics shows that such expectations are often expressed in publications and can be collected, logically analysed and applied to new genomes. In this project, a logical framework and a software infrastructure were developed, which aim at providing the basis for collecting from literature and verifying genome content expectations. They were developed as a series of software packages, mainly for the programming language Python, as well as format specifications and ontologies, which, altogether, represent a significant step towards the formalization of this kind of analysis.

Publications

 
 

Additional Information

Textvergrößerung und Kontrastanpassung