Julia Wunderle, M.Sc.
Julia Wunderle
Chair of Data Science (Informatik X)
University of Würzburg
Campus Hubland Nord
Emil-Fischer-Straße 50
97074 Würzburg
Germany
Email: julia.wunderle[at]uni-wuerzburg.de
Phone: (+49 931) 31-80351
Office: Room 50.03.020 (Zentrum für Künstliche Intelligenz und Data Science (CAIDAS))
About Me
I joined the Data Science Chair in December 2024 for my PhD studies after receiving my master's degree in Computer Science at the University of Würzburg.
My current research focuses on advancing the German NLP landscape. As part of this work, we trained both decoder and encoder model families from scratch (LLäMmlein and ModernGBERT) and maintain the first, comprehensive German NLU benchmark, SuperGLEBer. Since end of 2025, I am also involved in the soofi, a Germany-wide effort to develop an open 100B-parameter large language model.
Teaching
- Seminar: Ausgewählte Themen des Machine Learning (SS'25 )
Other
- CAIDAS Best Master Thesis Prize 2025
- Two First-Place and One Second-Place rankings at the GermEval 2025 Shared Task "Harmful Content Detection"
- First Place at SemEval 2024 Task 4 & Overall Best Paper Honorable Mention Award
Publications
-
Die SuperGLEBer at GermEval 2025 Shared Tasks: Growing Pains - When More Isn’t Always Better. . In Proceedings of the 21st Conference on Natural Language Processing (KONVENS 2025): Workshops, C. Wartena, U. Heid (eds.), pp. 479–493. HsH Applied Academics, Hannover, Germany, 2025. -
LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from Scratch. . In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), W. Che, J. Nabende, E. Shutova, M. T. Pilehvar (eds.), pp. 2227–2246. Association for Computational Linguistics, Vienna, Austria, 2025. -
ModernGBERT: German-only 1B Encoder Model Trained from Scratch. . 2025.
-
OtterlyObsessedWithSemantics at SemEval-2024 Task 4: Developing a Hierarchical Multi-Label Classification Head for Large Language Models. . In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), A. K. Ojha, A. S. Dougruöz, H. Tayyar Madabushi, G. Da San Martino, S. Rosenthal, A. Rosá (eds.), pp. 602–612. Association for Computational Linguistics, Mexico City, Mexico, 2024.
-
Pointer Networks: A Unified Approach to Extracting German Opinions. . In Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), M. Georges, A. Herygers, A. Friedrich, B. Roth (eds.), pp. 127–138. Association for Computational Lingustics, Ingolstadt, Germany, 2023.