SuperGLEBer - The first comprehensive German-language benchmark for LLMs
Large Language Models (LLMs) are continuously being developed and improved, and there is no shortage of benchmarks that quantify how well they work; LLM benchmarking is indeed a long-standing practice especially in the NLP research community. However, the majority of these benchmarks are not designed for German-language LLMs. We assembled a broad Natural Language Understanding benchmark suite for the German language and evaluated a wide array of existing German-capable models.
This allows us to comprehensively chart the landscape of German LLMs.
A deep unsupervised Model for Protein Design
The design of new functional proteins can tackle many of the problems humankind is facing today but so far has proven very challenging1. Analogies between protein sequences and human languages have been long noted and a summary of their most prominent similarities is described. Given the tremendous success of Natural Language Processing (NLP) methods in recent years, its application to protein research opens a fresh perspective, shifting from the current energy-function centered paradigm to an unsupervised learning approach based entirely on sequences. To explore this opportunity further we have pre-trained a generative language model on the entire protein sequence space. We find that our language model, ProtGPT2, effectively speaks the
Strong-field Response of complex Systems
The interaction of light with matter covers a large number of physical phenomena that we literally see in our everyday life. Early scientists mostly focused on investigations of electromagnetic radiation in the visible range and at low intensities, where material polarization responds linearly to incident electromagnetic fields. Utilizing the compute clusters at PC2, this project aims at simulating and interpreting the strong-field dynamics of real molecules and larger systems in a rigorous real-space real-time approach including non-linear strong-field effects such as photoionization and high-order harmonic generation of systems ranging from small (chiral) molecules over nano-systems to the condensed phase.
Thermal Evolution and Dynamics of the Interior of Planets and Moons
Over the past decades, large-scale computer simulations have grown to become one of the most powerful approaches to study the interior of Earth-like planets. Geodynamical models are used to investigate the evolution and distribution of the temperature inside the planet that ultimately affects its structure and the way the planet cools over time. Combined with data obtained from planetary missions and laboratory experiments, these models help us to improve our understanding of the history and current state of planets in our Solar System and beyond. These models can teach us about the formation and evolution of planetary environments