Intern
    Data Science Chair

    Deep Representation & Metric Learning

    The research area of representation learning focuses on developing algorithms and techniques that enable automatic extraction of meaningful and informative representations or features from raw data. This includes exploring various approaches, such as deep learning, unsupervised learning, and generative models, to learn hierarchical, abstract, and useful representations that capture important patterns and structures in the data. Representation learning aims to improve various machine learning tasks, including classification, clustering, and generation, by facilitating better data understanding and generalization.

     

    Publications

    • ModernGBERT: German-only ...
      ModernGBERT: German-only 1B Encoder Model Trained from Scratch. Ehrmanntraut, Anton; Wunderle, Julia; Pfister, Jan; Jannidis, Fotis; Hotho, Andreas. 2025.
    • {O}tterly{O}bsessed{W}ith...
      {O}tterly{O}bsessed{W}ith{S}emantics at {S}em{E}val-2024 Task 4: Developing a Hierarchical Multi-Label Classification Head for Large Language Models. Wunderle, Julia; Schubert, Julian; Cacciatore, Antonella; Zehe, Albin; Pfister, Jan; Hotho, Andreas. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), A. K. Ojha, A. S. Do{\u{g}}ru{\"o}z, H. Tayyar Madabushi, G. Da San Martino, S. Rosenthal, A. Ros{\’a} (eds.), pp. 602–612. Association for Computational Linguistics, Mexico City, Mexico, 2024.
    • LL\"aMmlein: Compact and ...
      LL\"aMmlein: Compact and Competitive German-Only Language Models from Scratch. Pfister, Jan; Wunderle, Julia; Hotho, Andreas. 2024.
    • BibSonomy Meets ChatLLMs ...
      BibSonomy Meets ChatLLMs for Publication Management: From Chat to Publication Management: Organizing your related work using BibSonomy & LLMs. Völker, Tom; Pfister, Jan; Koopmann, Tobias; Hotho, Andreas. 2024.
    • The {F}airy{N}et Corpus -...
      The {F}airy{N}et Corpus - Character Networks for {G}erman Fairy Tales. Schmidt, David; Zehe, Albin; Lorenzen, Janne; Sergel, Lisa; D{\"u}ker, Sebastian; Krug, Markus; Puppe, Frank. In Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 49–56. Association for Computational Linguistics, Punta Cana, Dominican Republic (online), 2021.
    • Detecting Scenes in Ficti...
      Detecting Scenes in Fiction: A new Segmentation Task. Zehe, Albin; Konle, Leonard; Dümpelmann, Lea; Gius, Evelyn; Hotho, Andreas; Jannidis, Fotis; Kaufmann, Lucas; Krug, Markus; Puppe, Frank; Reiter, Nils; Schreiber, Annekea; Wiedmer, Nathalie. In Proceedings of the 16th Conference of the {E}uropean Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. ACL, 2021.
    • Shared Task on Scene Segm...
      Shared Task on Scene Segmentation @ KONVENS 2021. Zehe, Albin; Konle, Leonard; Guhr, Svenja; Dümpelmann, Lea; Gius, Evelyn; Hotho, Andreas; Jannidis, Fotis; Kaufmann, Lucas; Krug, Markus; Puppe, Frank; Reiter, Nils; Schreiber, Annekea. In Shared Task on Scene Segmentation @ KONVENS 2021, pp. 1–21. 2021.
    • Emote-Controlled: Obtaini...
      Emote-Controlled: Obtaining Implicit Viewer Feedback through Emote based Sentiment Analysis on Comments of Popular Twitch.tv Channels. Kobs, Konstantin; Zehe, Albin; Bernstetter, Armin; Chibane, Julian; Pfister, Jan; Tritscher, Julian; Hotho, Andreas. In ACM Transactions on Social Computing. 2020.
    • HarryMotions – Classify...
      HarryMotions – Classifying Relationships in Harry Potter based on Emotion Analysis. Zehe, Albin; Arns, Julia; Hettinger, Lena; Hotho, Andreas. In 5th SwissText & 16th KONVENS Joint Conference. 2020.
    • LM4KG: Improving Common S...
      LM4KG: Improving Common Sense Knowledge Graphs with Language Models. Omeliyanenko, Janna; Zehe, Albin; Hettinger, Lena; Hotho, Andreas. In International Semantic Web Conference. Springer, 2020.
    • Detection of Scenes in Fi...
      Detection of Scenes in Fiction. Gius, Evelyn; Jannidis, Fotis; Krug, Markus; Zehe, Albin; Hotho, Andreas; Puppe, Frank; Krebs, Jonathan; Reiter, Nils; Wiedmer, Nathalie; Konle, Leonard. In Proceedings of Digital Humanities 2019. 2019.
    • Analysing Direct Speech i...
      Analysing Direct Speech in German Novels. Jannidis, Fotis; Konle, Leonard; Zehe, Albin; Hotho, Andreas; Krug, Markus. In DHd 2018. 2018.
    • Burrows’ Zeta: Exploring and Evaluating Variants and Parameters. Schöch, Christof; Schlör, Daniel; Zehe, Albin; Gebhard, Henning; Becker, Martin; Hotho, Andreas. In DH, pp. 274–277. 2018.
    • A White-Box Model for Det...
      A White-Box Model for Detecting Author Nationality by Linguistic Differences in Spanish Novels. Zehe, Albin; Schlör, Daniel; Henny-Krahmer, Ulrike; Becker, Martin; Hotho, Andreas. In DH. ADHO, 2018.
    • Burrows Zeta: Varianten u...
      Burrows Zeta: Varianten und Evaluation. Schöch, Christof; Calvo, José; Zehe, Albin; Hotho, Andreas. In DHd 2018. 2018.
    • Towards Sentiment Analysi...
      Towards Sentiment Analysis on German Literature. Zehe, Albin; Becker, Martin; Jannidis, Fotis; Hotho, Andreas. 2017.
    • Prediction of Happy Endin...
      Prediction of Happy Endings in German Novels. Zehe, Albin; Becker, Martin; Hettinger, Lena; Hotho, Andreas; Reger, Isabella; Jannidis, Fotis. In Proceedings of the Workshop on Interactions between Data Mining and Natural Language Processing 2016, P. Cellier, T. Charnois, A. Hotho, S. Matwin, M.-F. Moens, Y. Toussaint (eds.), pp. 9–16. 2016.
    • Classification of Literar...
      Classification of Literary Subgenres. Hettinger, Lena; Jannidis, Fotis; Reger, Isabella; Hotho, Andreas. In DHd 2016. 2016.
    • Straight Talk! Automatic ...
      Straight Talk! Automatic Recognition of Direct Speech in Nineteenth-Century French Novels. Schöch, Christof; Schlör, Daniel; Popp, Stefanie; Brunner, Annelen; Henny, Ulrike; Tello, Jos{\’e} Calvo. In DH, pp. 346–353. 2016.
    • Analyzing Features for th...
      Analyzing Features for the Detection of Happy Endings in German Novels. Jannidis, Fotis; Reger, Isabella; Zehe, Albin; Becker, Martin; Hettinger, Lena; Hotho, Andreas. 2016.
    • Straight Talk! Automatic ...
      Straight Talk! Automatic Recognition of Direct Speech in Nineteenth-Century French Novels. Sch{\"o}ch, Christof; Schl{\"o}r, Daniel; Popp, Stefanie; Brunner, Annelen; Henny, Ulrike; Tello, Jos{\’e} Calvo. In DH, pp. 346–353. 2016.
    • Significance Testing for ...
      Significance Testing for the Classification of Literary Subgenres. Hettinger, Lena; Jannidis, Fotis; Reger, Isabella; Hotho, Andreas. In DH 2016. 2016.
    • Genre classification on G...
      Genre classification on German novels. Hettinger, Lena; Becker, Martin; Reger, Isabella; Jannidis, Fotis; Hotho, Andreas. In Proceedings of the 12th International Workshop on Text-based Information Retrieval. 2015.
    • Proceedings of the 1st In...
      Proceedings of the 1st International Workshop on Interactions between Data Mining and Natural Language Processing co-located with The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, DMNLP@PKDD/ECML 2014, Nancy, France, September 15, 2014. Cellier, Peggy; Charnois, Thierry; Hotho, Andreas; Matwin, Stan; Moens, Marie{-}Francine; Toussaint, Yannick. In Vol. 1202 of {CEUR} Workshop Proceedings. CEUR-WS.org, 2014.