Project Details
Projekt Print View

Grammar Formalisms beyond Context-Free Grammars and their use for Machine Learning Tasks

Subject Area General and Comparative Linguistics, Experimental Linguistics, Typology, Non-European Languages
Term from 2010 to 2021
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 183821958
 
The aim of this project is to develop new models and algorithms forprobabilistic parsing and for statistical machine translation, usingformalisms such as Linear Context-Free Rewriting Systems (LCFRSs),i.e., extensions of Context-Free Grammars that combine aspects ofsynchronous grammars with the capacity to describe discontinuities.Both, parsing and machine translation, face the challenge that theyhave to deal with non-local dependencies, to put it differently, withdiscontinuous units in a sentence that belong together. One way toaccount for these non-local phenomena is to use formalisms that candescribe more than just context-free structures. In the first part ofthe project, this has already been successfully done in parsing, wherewe use LCFRS. We plan to continue this line of research in the secondpart of the project. Concerning syntax-based statistical machinetranslation, we have shown in the first part of the project thatsynchronous LCFRS can describe the sometimes complex alignments oneencounters in parallel corpora. In the second part of the project wetherefore aim at completing the (already started) implementation of astatistical machine translation system based on synchronous LCFRS.
DFG Programme Research Grants
 
 

Additional Information

Textvergrößerung und Kontrastanpassung