Project Details
Explanations for healthy distrust in large language models (C01)
Subject Area
General, Cognitive and Mathematical Psychology
Methods in Artificial Intelligence and Machine Learning
Methods in Artificial Intelligence and Machine Learning
Term
since 2021
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 438445824
Since ML models have limitations, human ability to question and distrust their decisions is crucial for human-AI-interaction. C01 established a common terminology for distrust, demonstrated that distrust is not easily fostered, and developed novel machine learning algorithms to identify and explain model uncertainty. We will now develop interventions to foster healthy distrust in the domain of academic writing with LLM support with a novel type of perplexing explanations. The TRR will hereby be provided with a tool to automatically generate explanations that support human agency.
DFG Programme
CRC/Transregios
Subproject of
TRR 318:
Constructing explainability
Applicant Institution
Universität Paderborn
Project Heads
Professorin Dr. Barbara Hammer; Professor Dr. Benjamin Paaßen, since 1/2026; Professorin Dr. Ingrid Scharlau
