Natural Language Processing
In the field of Knowledge-Enriched NLP, we work on current topics of Natural Language Processing. Specifically, we are adapting and improving large language models (LLMs) such as BERT and its derivatives. Our particular focus lies in incorporating explicit knowledge, such as knowledge graphs.
Our application areas range from analyzing historical literature (where current language models struggle due to the length of the texts) to product reviews, and even to unconventional media forms for NLP, such as comments on http://twitch.tv. These media forms present their own challenges due to their unique language style. In addition to analyzing pure text, we also investigate the adaptability of NLP methods for processing mathematical equations.
In projects like Kallimachos or CLiGS we collaborate with literary scholars and work on literary and NLP research questions. In MOTIV, we work with psychologists to analyse the interaction between users and smart devices.
Projects
Concluded Projects
-
Kallimachos - Building a complete text analysis pipeline, starting with OCR from paper and going up to high-level text mining.
-
CLiGS - CLiGS combines large text collections with innovative analysis methods and hermeneutic sensibility for context.
Publications
-
Zero-Shot Clickbait Spoiling by Rephrasing Titles as Questions in Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023) (2023). 1090–1095.
-
Point me to your Opinion, {S}en{P}oi in Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) (2022). 1313–1323.
-
The {F}airy{N}et Corpus - Character Networks for {G}erman Fairy Tales in Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (2021). 49–56.
-
Shared Task on Scene Segmentation @ KONVENS 2021 in Shared Task on Scene Segmentation @ KONVENS 2021 (2021). 1–21.
-
Detecting Scenes in Fiction: A new Segmentation Task in Proceedings of the 16th Conference of the {E}uropean Chapter of the Association for Computational Linguistics: Volume 1, Long Papers (2021).
-
LM4KG: Improving Common Sense Knowledge Graphs with Language Models in International Semantic Web Conference (2020).
-
HarryMotions – Classifying Relationships in Harry Potter based on Emotion Analysis in 5th SwissText & 16th KONVENS Joint Conference (2020).
-
Emote-Controlled: Obtaining Implicit Viewer Feedback through Emote based Sentiment Analysis on Comments of Popular Twitch.tv Channels in ACM Transactions on Social Computing (2020).
-
Detection of Scenes in Fiction in Proceedings of Digital Humanities 2019 (2019).
-
Analysing Direct Speech in German Novels in DHd 2018 (2018).
-
Burrows Zeta: Varianten und Evaluation in DHd 2018 (2018).
-
Burrows’ Zeta: Exploring and Evaluating Variants and Parameters in DH (2018). 274–277.
-
A White-Box Model for Detecting Author Nationality by Linguistic Differences in Spanish Novels in DH (2018).
-
Towards Sentiment Analysis on German Literature (2017).
-
Straight Talk! Automatic Recognition of Direct Speech in Nineteenth-Century French Novels. in DH (2016). 346–353.
-
Analyzing Features for the Detection of Happy Endings in German Novels (2016).
-
Prediction of Happy Endings in German Novels in Proceedings of the Workshop on Interactions between Data Mining and Natural Language Processing 2016, P. Cellier, T. Charnois, A. Hotho, S. Matwin, M.-F. Moens, Y. Toussaint (eds.) (2016). 9–16.
-
Significance Testing for the Classification of Literary Subgenres in DH 2016 (2016).
-
Classification of Literary Subgenres in DHd 2016 (2016).
-
Genre classification on German novels in Proceedings of the 12th International Workshop on Text-based Information Retrieval (2015).