Combinatorics of Word Morphisms
Final Report Abstract
This project was focused on two main directions. On the one hand, we wanted to develop combinatorial and language theoretic tools allowing us to get a better understanding of the complexity of solving word equations, potentially of restricted form, and to obtain more insights in the structure of the solution-sets for such equations. On the other hand, we wanted to also perform a more general study of the combinatorial and algorithmic properties of word-morphisms, extending beyond the scope of word equations. The main result that we have obtained regarding word equations is the following. Starting from the fact that for quadratic word equations there exists an algorithm based on rewriting rules which generates a directed graph describing all solutions to the equation, we were able to show that, in the case of regular word equations – those for which each variable occurs at most once on each side of the equation –, this graph has nice combinatorial properties. In particular, we were able to show polynomial bounds on its diameter, size, and DAG-width, as well as to obtain some deeper insights into symmetries in its structure. As a consequence, we have obtained a combinatorial proof that the problem of deciding whether a regular word equation has a solution is in NP. This breakthrough result concluded a longer line of research on regular and quadratic word equations, and exhibits one of the largest classes of word equations whose satisfiability is in NP. These results were complemented by a series of investigations, of both theoretical and practical nature, regarding the decidability of other classes of string constraints (that is, word equations with additional linear length constraints and language membership constraints). In the area of the study of combinatorial and algorithmic properties of word-morphisms, our main results come from the investigation of morphisms mapping words to their set of subsequences of a given length. In particular, we have solved a long standing open problem, stated by Imre Simon in the 1970s, by showing that one can compute in linear time the largest number k such that two input words have exactly the same set of subsequences of length k. This result came as the combination of some deep combinatorial observations with the construction of novel efficient data structures. Moreover, it opened a fruitful line of research regarding combinatorial pattern matching and analysis problems for the subsequence-sets of strings.
Publications
-
Detecting One-Variable Patterns. Lecture Notes in Computer Science, 254-270. Springer International Publishing.
Kosolobov, Dmitry; Manea, Florin & Nowotka, Dirk
-
Local patterns. In Proc. FSTTCS 2017, volume 93 of LIPIcs, pages 24:1–24:14
J. D. Day, P. Fleischmann, F. Manea & D. Nowotka
-
The hardness of solving simple word equations. In Proc. MFCS 2017, volume 83 of LIPIcs, pages 18:1–18:14
J. D. Day, F. Manea & D. Nowotka
-
On Matching Generalised Repetitive Patterns. Lecture Notes in Computer Science, 269-281. Springer International Publishing.
Day, Joel D.; Fleischmann, Pamela; Manea, Florin; Nowotka, Dirk & Schmid, Markus L.
-
On the Complexity of Solving Restricted Word Equations. International Journal of Foundations of Computer Science, 29(05), 893-909.
Manea, Florin; Nowotka, Dirk & Schmid, Markus L.
-
The Satisfiability of Word Equations: Decidable and Undecidable Theories. Lecture Notes in Computer Science, 15-29. Springer International Publishing.
Day, Joel D.; Ganesh, Vijay; He, Paul; Manea, Florin & Nowotka, Dirk
-
Unary patterns under permutations. Theoretical Computer Science, 743, 72-82.
Currie, James; Manea, Florin; Nowotka, Dirk & Reshadi, Kamellia
-
Graph and string parameters: Connections between pathwidth, cutwidth and the locality number. In Proc. ICALP 2019, volume 132 of LIPIcs, pages 109:1–109:16.
K. Casel, J. D. Day, P. Fleischmann, T. Kociumaka, F. Manea & M. L. Schmid
-
Hide and seek with repetitions. Journal of Computer and System Sciences, 101, 42-67.
Gawrychowski, Paweł; Manea, Florin; Mercaş, Robert & Nowotka, Dirk
-
Matching Patterns with Variables. Lecture Notes in Computer Science, 1-27. Springer International Publishing.
Manea, Florin & Schmid, Markus L.
-
On Solving Word Equations Using SAT. Lecture Notes in Computer Science, 93-106. Springer International Publishing.
Day, Joel D.; Ehlers, Thorsten; Kulczynski, Mitja; Manea, Florin; Nowotka, Dirk & Poulsen, Danny Bøgsted
-
Repetition avoidance in products of factors. Theoretical Computer Science, 791, 123-126.
Fleischmann, Pamela; Ochem, Pascal & Reshadi, Kamellia
-
Rollercoasters: Long Sequences without Short Runs. SIAM Journal on Discrete Mathematics, 33(2), 845-861.
Biedl, Therese; Biniaz, Ahmad; Cummings, Robert; Lubiw, Anna; Manea, Florin; Nowotka, Dirk & Shallit, Jeffrey
-
Theoretical and Practical Aspects Related to the Avoidability of Patterns in Words. PhD thesis, University of Kiel, Germany, 2019
K. Reshadi
-
Upper bounds on the length of minimal solutions to certain quadratic word equations. In Proc. MFCS 2019, volume 138 of LIPIcs, pages 44:1–44:15.
J. D. Day, F. Manea & Dirk Nowotka
-
Equations enforcing repetitions under permutations. Discrete Applied Mathematics, 285, 61-78.
Day, Joel D.; Fleischmann, Pamela; Manea, Florin & Nowotka, Dirk
-
On the structure of solution sets to regular word equations. In Proc. ICALP 2020, volume 168 of LIPIcs, pages 124:1–124:16.
J. D. Day & F. Manea
-
Pattern Matching with Variables. ACM Transactions on Computation Theory, 12(1), 1-37.
Fernau, Henning; Manea, Florin; Mercaş, Robert & Schmid, Markus L.
-
Rule-based Word Equation Solving. Proceedings of the 8th International Conference on Formal Methods in Software Engineering, 87-97. ACM.
Day, Joel D.; Kulczynski, Mitja; Manea, Florin; Nowotka, Dirk & Poulsen, Danny Bøgsted
-
Scattered Factor-Universality of Words. Lecture Notes in Computer Science, 14-28. Springer International Publishing.
Barker, Laura; Fleischmann, Pamela; Harwardt, Katharina; Manea, Florin & Nowotka, Dirk
-
An SMT Solver for Regular Expressions and Linear Arithmetic over String Length. Lecture Notes in Computer Science, 289-312. Springer International Publishing.
Berzish, Murphy; Kulczynski, Mitja; Mora, Federico; Manea, Florin; Day, Joel D.; Nowotka, Dirk & Ganesh, Vijay
-
Efficiently testing simon’s congruence. In Proc. STACS 2021, volume 187 of LIPIcs, pages 34:1–34:18
P. Gawrychowski, M. Kosche, T. Koß, F. Manea & S. Siemer
-
Matching patterns with variables under hamming distance. In Proc. MFCS 2021, volume 202 of LIPIcs, pages 48:1–48:24.
P. Gawrychowski, F. Manea & S. Siemer
-
The edit distance to k-subsequence universality. In Proc. STACS 2021, volume 187 of LIPIcs, pages 25:1–25:19.
J. D. Day, P. Fleischmann, M. Kosche, T. Koß, F. Manea & S. Siemer
-
ZaligVinder: A generic test framework for string solvers. Journal of Software: Evolution and Process, 35(4).
Kulczynski, Mitja; Manea, Florin; Nowotka, Dirk & Poulsen, Danny Bøgsted
-
Fast and Longest Rollercoasters. Algorithmica, 84(4), 1081-1106.
Gawrychowski, Paweł; Manea, Florin & Serafin, Radosław
-
Matching Patterns with Variables Under Edit Distance. Lecture Notes in Computer Science, 275-289. Springer International Publishing.
Gawrychowski, Paweł; Manea, Florin & Siemer, Stefan
-
Subsequences in Bounded Ranges: Matching and Analysis Problems. Lecture Notes in Computer Science, 140-159. Springer International Publishing.
Kosche, Maria; Koß, Tore; Manea, Florin & Pak, Viktoriya
-
Subsequences with gap constraints: Complexity bounds for matching and analysis problems. In Proc. ISAAC 2022, volume 248 of LIPIcs, pages 64:1–64:18.
J. D. Day, M. Kosche, F. Manea & M. L. Schmid
-
Absent Subsequences in Words. Fundamenta Informaticae, 189(3-4), 199-240.
Kosche, Maria; Koß, Tore; Manea, Florin & Siemer, Stefan
-
k-universality of regular languages. In Proc. ISAAC 2023, volume 283 of LIPIcs, pages 4:1–4:21
D. Adamson, P. Fleischmann, A. Huch, T. Koß, F. Manea & D. Nowotka
-
Longest Common Subsequence with Gap Constraints. Lecture Notes in Computer Science, 60-76. Springer Nature Switzerland.
Adamson, Duncan; Kosche, Maria; Koß, Tore; Manea, Florin & Siemer, Stefan
-
Matching Patterns with Variables Under Simon’s Congruence. Lecture Notes in Computer Science, 155-170. Springer Nature Switzerland.
Fleischmann, Pamela; Kim, Sungmin; Koß, Tore; Manea, Florin; Nowotka, Dirk; Siemer, Stefan & Wiedenhöft, Max
-
On the Expressive Power of String Constraints. Proceedings of the ACM on Programming Languages, 7(POPL), 278-308.
Day, Joel D.; Ganesh, Vijay; Grewal, Nathan & Manea, Florin
-
Towards more efficient methods for solving regular-expression heavy string constraints. Theoretical Computer Science, 943, 50-72.
Berzish, Murphy; Day, Joel D.; Ganesh, Vijay; Kulczynski, Mitja; Manea, Florin; Mora, Federico & Nowotka, Dirk
