Natural Language Processing

Digital Humanities and the application and develoment of Natural Language Processing methods is an active research field in the Data Science Chair. In projects like LitBERT, Kallimachos or CLiGS we collaborate with literary scholars and work on literary and NLP research questions. Current research topics involve for example the detection of direct speech, the classification of text types or sentiment analysis in a literary context.

The following staff member have open topics for practica, bachelor and master theses:

Natural Language Processing (for Novels), Digital Humanities	Albin Zehe
knowledge graphs, unstructured knowledge representations	Janna Omeliyanenko
NLP, Aspect-based Sentiment Analysis, Pointer Networks	Jan Pfister

In the case of excellent performance there is also the chance to submit the thesis as an article to a computer science conference and to be co-author on a scientific publication early in your studies!

Open Topics

10/06/2025 | Natural Language Processing, Misc.

Foundation Large Language Model for HTML

Current large language models (LLMs) have been trained on large corpora of texts and can solve many text-related tasks very well. This work aims to investigate various neural architectures for creating a foundation model for HTML. more

02/17/2025 | Natural Language Processing

Exploring and Evaluating Various Domain Adaptation Techniques

Domain adaptation involves modifying a model, originally trained on general data, to perform effectively in more specialized fields such as medicine, politics, law or instruction following. more

Hubland Nord

Foundation Large Language Model for HTML

Exploring and Evaluating Various Domain Adaptation Techniques

Picture credits