Project Details
BERT with Character - Knowledge Graph infused neural language models to analyse the depiction of literary characters (LitBERT)
Subject Area
Image and Language Processing, Computer Graphics and Visualisation, Human Computer Interaction, Ubiquitous and Wearable Computing
Term
since 2023
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 529659926
This project is a collaboration between computer science and computational literary studies (CLS), an emergent field which analyses larger collections of literary texts using a wide set of tools from computational linguistics, computer science and its own tradition. In our project, we focus on the computational literary analysis of character as one of the most important descriptors of narrative and dramatic texts. Our project will investigate the textual description of character's internal and external features, actions and further character specific information using knowledge induced language models. We aim to create a character knowledge graph through extracting character information from text, find different character types through data-driven clustering, and leverage this information to develop a character-attentive, "literary" language model ("LitBERT") for automatic literary analysis. The project will significantly advance the state of the art in the combination of language models and knowledge graphs, showing how to improve the performance of language models for the analysis of entities and their attributes by (a) integrating knowledge graphs and (b) enriching domain specific knowledge graphs based on text analysis using language models. Additionally, we want to improve the handling of longer texts like novels by advancing the capabilities of language models to represent knowledge, like representation and types of characters in the text world (i.e., the world described in the text).
DFG Programme
Research Grants