Detailseite
Projekt Druckansicht

Personalisierte krebsspezifische Netzwerke

Fachliche Zuordnung Bioinformatik und Theoretische Biologie
Allgemeine Genetik und funktionelle Genomforschung
Strukturbiologie
Zellbiologie
Förderung Förderung von 2017 bis 2022
Projektkennung Deutsche Forschungsgemeinschaft (DFG) - Projektnummer 326946590
 
Erstellungsjahr 2024

Zusammenfassung der Projektergebnisse

Bioinformatics finds biomarkers involved in tumorigenesis. One main goal of the project was to investigate isoform switching, which often translates to the loss or gain of domains mediating protein interactions and thus, re-wires the set of all interacting proteins in an organism (referred to as interactome). We found several probable novel biomarkers such as the THCA BRAF-like specific 4-gene signature biomarker (ODAM, APP, IKBKG, and TOLLIP) involved in tumorigenesis. The ability to determine uniquely distorted interactions whose participants are predictive of patient survival opens up the possibility to computationally obtain potential protein biomarkers for specific cancer types and subtypes. Better prediction of interaction sites in membrane proteins. Interactions between transmembrane (TM) proteins are fundamental for a wide spectrum of cellular functions. Across the main functional families of human transmembrane proteins, the percentage of amino acid sites predicted to be involved in interactions typically ranges between 10% and 25%, and up to 30% in ion channels. We have developed DeepTMInter that improved the sequence-based prediction of interaction sites in helical transmembrane proteins by deep learning. Family-specific analysis of variant pathogenicity prediction tools. Many methods predict the effect of single amino acid variants (SAVs) upon molecular protein function. Correlating such predictions with pathogenicity, we found each tool to have its Achilles heel, i.e., cases with unreliable predictions. Families more sensitive to changes were often involved in, e.g., regulating transcription, while those not leading to disease upon mutation were often related to immune and stress responses. These results suggest to consider functional annotation to improve effect predictions. Identification of disease-causing mutations in human transmembrane proteins. We developed a suite of sequence-based computational methods Pred-MutHTP. Discriminating between disease-causing and neutral SAVs that pushed the state-of-the-art with regard to predicting the effects of variants in the transmembrane protein segments. Protein Language Models (pLMs) change protein bioinformatics. AlphaFold2 has dramatically revolutionized the life sciences by reliably predicting protein 3D structure. Orthogonal to its success which builds upon multiple-sequence alignments (MSAs), we have developed so-called protein language models (pLMs) that bypass the increasingly expensive database search needed to generate MSAs. Instead, pLMs extract information from single protein sequences. Crucial for their success is their ability to learn information directly from large amounts of unlabeled protein sequence data. After an initial pre-training phase, the information learned can be transferred to develop downstream prediction tasks by inputting representations extracted from the pLM (referred to as embeddings). We used embeddings to predict sequence conservation and the effects of SAVs without the need for MSAs. Using embeddings from pLMs as input also reduced the amount of data needed to train subsequent prediction tasks. This improved performance for various tasks relevant for understanding the protein networks underlying most diseases, i.e., we successfully predicted various ligand binding classes, cellular location, disorder, and gene ontology. All methods and data sets are freely available online. As announced in our proposal, we have made predictions from our new pLM-based methods easily accessible via the new LambdaPP web server. The unexpected and vast potential of pLMs clearly constitutes a major game change in protein bioinformatics as attested to be the number of citations to several of the papers describing pLMs.

Projektbezogene Publikationen (Auswahl)

 
 

Zusatzinformationen

Textvergrößerung und Kontrastanpassung