Project Details
The DNA from a Coding Perspektive
Subject Area
Electronic Semiconductors, Components and Circuits, Integrated Systems, Sensor Technology, Theoretical Electrical Engineering
Term
from 2012 to 2015
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 214186146
So far, we have found interesting relationships between mutation probabilities of codons and chemical differences in corresponding amino acids. Up to a few exceptions that need to be studied further, one can conclude that high mutational probabilities between two codons go together with rather relatedamino-acids, hence less damage. Within the coding regions, we did not find any error-correction procedure apart from the known mapping of codons to amino acids. Mutation probabilities also lead to a communication channel description and the capacity is compared to the minimum required information content given by the 20 amino acids.We modified all Lempel-Ziv source coding algorithms following the biological example of "embedded proteins", i.e., we modified them to be bi-directional making use of symmetries in the data.Using Next Generation Sequencing we found that i) the spatial organization of coding units is a determinant of coherent gene expression; ii) the divergent coding units in close proximity show optimal activity at physiological levels of DNA superhelicity; iii) we developed a novel concept of the DNA as a carrier of dual - analog and digital - information; iv) we generated datasets corresponding to patterns of analog information (DNA melting energy and supercoiling) and digital information (distinct functionalclasses of genes) and integrated the physicochemical properties of the genes with their functional content; v) we detected transient chromosomal domains mediating the conversion of the 3D DNA structures into expression patterns; vi) we showed that conversion of the DNA analog information into the digital linear code is a basic device coordinating the gene transcription.Having used multidimensional scaling as a tool to investigate distances between codons and amino-acids, sparse subspace clustering will be studied in the second project phase. However, the interpretation of the obtained results will be essential. Coding properties of promoter regions will be looked into and the comparison between Boltzmann and Shannon entropy is expected to give some more insights. We are keen to know how the physicochemical parameters of the DNA (DNA syntax) are converted into genetic function (DNA semantics). The relationship between Shannon and Boltzmann entropy is seen as the key to obtain insights into protection mechanisms and gene expression.
DFG Programme
Priority Programmes