Detailseite
Projekt Druckansicht

SPP 1527:  Autonomes Lernen

Fachliche Zuordnung Informatik, System- und Elektrotechnik
Biologie
Mathematik
Medizin
Physik
Sozial- und Verhaltenswissenschaften
Förderung Förderung von 2011 bis 2020
Projektkennung Deutsche Forschungsgemeinschaft (DFG) - Projektnummer 172415596
 
Erstellungsjahr 2019

Zusammenfassung der Projektergebnisse

There could not be a more timely topic in AI as Autonomous Learning. During the term of our priority programme AI has developed massively and disruptively, with Deep Learning and Deep RL emerging and now dominating the field. The big-data ML problem of extracting a predictor when enough data is available made enormous progress. But the core scientific challenges defined by our initiative are still pressing: • Let AI be more autonomous about its choice of representation, model, structural priors, hyperparameters, so that it can generalize truly strongly from the available data. • Let AI be more autonomous in collecting and generating its own data. • Realize strongly generalizing AI on real-world autonomous systems, to ensure that AI can also capture, model and exploit the particular structure of the one physical world we live in. The goal of the priority programme was to foster research to tackle these scientific challenges, to support the qualification of young researchers in these research areas, and to support our PIs and young researchers to strengthen their impact and networking with the growing local and international communities working on these research areas. The scientific and structural impact of the second phase of our priority programme was substantial. With an estimated 129 peer-reviewed papers (51 journal, 78 conference), 12 PhD students graduating within the programme, and 6 expected to graduate soon, and several PIs winning full professorships during, the second phase of the programme was highly successful. The SPP greatly supported collaborations, in particular international collaborations based on the NSF-DFG Collaborative Research Programme, and thereby substantially supported our PIs and young researchers to establish themselves in the growing international communities on the topic. Today, Autonomous Learning is not anymore a small niche topic with a small local research network, instead its research agenda became central to modern international AI research. Our PIs and the research agenda of autonomous learning now play a major role in the top conferences in the field.

Projektbezogene Publikationen (Auswahl)

  • Construction of approximation spaces for reinforcement learning. Journal of Machine Learning Research, 2013
    Böhmer, Grünew älder, Shen, Musial, Obermayer
  • Hierarchical System for Word Discovery Exploiting DTW-Based Initialization. In Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013)
    Walter, Korthals, Haeb-Umbach, Raj
    (Siehe online unter https://doi.org/10.1109/ASRU.2013.6707761)
  • Example-based feedback provision using structured solution spaces. International Journal of Learning Technology, 2014
    Gross, Mokbel, Paaßen, Hammer, Pinkwart
    (Siehe online unter https://doi.org/10.1504/IJLT.2014.065752)
  • A Theory of Cheap Control in Embodied Systems. PLoS Computational Biology (2015) 11(9)
    Montúfar, Ghazi-Zahedi, Ay
    (Siehe online unter https://doi.org/10.1371/journal.pcbi.1004427)
  • Autonomous learning of state representations for control: An emerging field aims to autonomously learn state representations for reinforcement learning agents from their real-world sensor observations. KI - Künstliche Intelligenz, 2015
    Böhmer, Springenberg, Boedecker, Riedmiller, Obermayer
    (Siehe online unter https://doi.org/10.1007/s13218-015-0356-1)
  • Efficient and robust automated machine learning International Conference on Advances in Neural Information Processing Systems, 2015
    Feurer, Klein, Eggensperger, Springenberg, Blum, Hutter
  • Embed to control: A locally linear latent dynamics model for control from raw images. In AdvancesNeural Information Processing Systems (NIPS), 2015
    Watter, Springenberg, Boedecker, Riedmiller
  • Bayesian optimization with robust Bayesian neural networks. International Conference on Advances in Neural Information Processing Systems, 2016
    Springenberg, Klein, Falkner, Hutter
  • Evaluating Morphological Computation in Biomechanical Muscle Models. Frontiers in Robotics and AI (2016)
    Ghazi-Zahedi, Haeufle, Montúfar, Schmitt, Ay
    (Siehe online unter https://doi.org/10.3389/frobt.2016.00042)
  • Deep reinforcement learning with successor features for navigation across similar environments. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017
    Zhang, Springenberg, Boedecker, Burgard
    (Siehe online unter https://doi.org/10.1109/IROS.2017.8206049)
  • Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. In Proc. Interspeech 2017
    Ebbers, Heymann, Drude, Glarner, Haeb-Umbach, Raj
    (Siehe online unter https://doi.org/10.21437/Interspeech.2017-1160)
  • Local Bayesian Optimization of Motor Skills International Conference on Machine Learning (ICML), 2017
    Akrour, Sorokin, Peters, Neumann
  • Opening a lockbox through physical exploration. In Proceedings of the IEEE International Conference on Humanoid Robots (Humanoids), 2017
    Baum, Bernstein, Martín-Martín, Höfer, Kulick, Toussaint, Kacelnik, Brock
    (Siehe online unter https://doi.org/10.1109/HUMANOIDS.2017.8246913)
  • Active tactile exploration based on cost-aware information gain maximization. International Journal of Humanoid Robotics, 2018
    Ottenhaus, Kaul, Vahrenkamp, Asfour
    (Siehe online unter https://doi.org/10.1142/S0219843618500159)
  • Can clustering scale sublinearly with its clusters? A variational EM acceleration of GMMs and k-means. International Conference on Artificial Intelligence and Statistics (AISTATS), 2018
    Forster, Lücke
  • Estimating an articulated tool’s kinematics via visuo-tactile based robotic interactive manipulation. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018
    Li, Uckermann, Haschke, Ritter
    (Siehe online unter https://doi.org/10.1109/IROS.2018.8594295)
  • Learning manipulation skills from a single demonstration. The International Journal of Robotics Research, 2018
    Englert, Toussaint
    (Siehe online unter https://doi.org/10.1177%2F0278364917743795)
  • Model-Free Trajectory-based Policy Optimization with Monotonic Improvement. Journal of Machine Learning Research, 2018
    Akrour, Abdolmaleki, Abdulsamad, Peters, Neumann
    (Siehe online unter https://doi.org/10.15496/publikation-39138)
  • Neural Simpletrons - Learning in the Limit of Few Labels with Directed Generative Networks. Neural Computation, 2018
    Forster, Sheikh, Lücke
    (Siehe online unter https://doi.org/10.1162/neco_a_01100)
  • The continuous hint factory - providing hints in vast and sparsely populated edit distance spaces. Journal of Educational Datamining, 2018
    Paaßen, Hammer, Price, Barnes, Gross, Pinkwart
    (Siehe online unter https://doi.org/10.5281/zenodo.3554698)
 
 

Zusatzinformationen

Textvergrößerung und Kontrastanpassung