Project Details
Active learning in the Koopman framework
Applicant
Professor Dr. Karl Worthmann
Subject Area
Automation, Mechatronics, Control Systems, Intelligent Technical Systems, Robotics
Term
since 2025
Project identifier
Deutsche Forschungsgemeinschaft (DFG) - Project number 535860958
This project is part of the research unit "Active Learning for Systems and Control (ALeSCo) - Data Informativity, Uncertainty, and Guarantees". We are going to use the well-established extended dynamic mode decomposition (EDMD) within the Koopman framework to generate data-driven models of dynamical control systems. Then, we leverage the prediction capacity of the derived surrogate models to incorporate active learning in alignment with novel data-informativity notions in model predictive control (MPC). To this end, the assessment of retrieved data is conditioned on the control objective to properly balance exploration and exploitation. The underlying idea of the Koopman framework is to lift a complex, highly-nonlinear dynamical system to render ist dynamics linear. While the associated Koopman operator is linear, it is - in general - infinite dimensional. Here, EDMD yields a finite-dimensional approximation, which can also be extended to control systems. To this end, the dynamics are restricted to a subspace spanned by a finite number of observable functions (projection), before a regression problem is solved to approximately compute the so-called compression (estimation). While EDMD is widely applied in practice, even for extremely-challenging applications like molecular or fluid dynamics, mathematically-sound finite-data error bounds are much more scarce and typically not uniform in the state. Recently, we proposed the first uniform bounds using kernel EDMD. The two key steps were Koopman invariance of the imposed reproducing kernel Hilbert space (RKHS) and the choice of the kernel as radial basis functions with compact support and an adjustable degree of smoothness. Building upon promising results on control-Koopman operator regression (cKor) and our prior work, we will extend these uniform error bounds to control systems and derive, in addition, task-specific bounds, e.g., proportional error estimates for set-point stabilization. Then, we encode prior knowledge and incorporate active-learning in our predictive-control algorithms. For the latter, we make use of information-theoretic concepts like data informativity and develop tailored persistency-of-excitation conditions. Further, we set up a bi-objective optimal control problem to properly assess the exploration/exploitation trade off. Then, goal-oriented adaptive refinements, consistency projectors, and diffusion-based exploration are leveraged, in addition, to foster active learning while taking the geometry of the retrieved data into account. Throughout the project, we apply the developed algorithms to two exemplarily-chosen representative benchmark applications, i.e., robotics and multi-energy systems, to validate the employed information-theoretic measures like data informativity and the inferred active-learning components. Overall, we develop a novel innovative active-learning MPC scheme within the Koopman framework.
DFG Programme
Research Units
