Open Topics

21.10.2025 | Biological&Medical Data

Multimodal Integration of scRNA-seq and ATAC-seq Data for Optimizing Latent Representations

This thesis investigates multimodal representation learning for high-dimensional, sparse biological data. The core task is to develop a machine learning model to jointly embed scRNA-seq (gene expression) and ATAC-seq (chromatin… Mehr

06.10.2025 | Natural Language Processing, Misc.

Foundation Large Language Model for HTML

Current large language models (LLMs) have been trained on large corpora of texts and can solve many text-related tasks very well. This work aims to investigate various neural architectures for creating a foundation model for HTML. Mehr

06.10.2025 | Biological&Medical Data

Generating Single Cell RNA-seq Data with Diffusion Models in a Semi-Unsupervised Setting

Great progress has been made recently in the field of single-cell analysis. However, there are still some major hurdles, such as generating high-quality data. On the other hand, diffusion models are on everyone's lips, which would… Mehr

06.10.2025 | Biological&Medical Data

Assessing the Suitability of (Zero-Inflated) Negative Binomial Distributions for scRNA-seq Gene Expression Data

Single-cell RNA sequencing (scRNA-seq) data is highly sparse, with many zero values that pose challenges for statistical modeling. Two suitable distributions to use are the negative binomial (NB) and the zero-inflated negative… Mehr

06.10.2025 | Biological&Medical Data

Using Explainable AI to Uncover Gene Relevance in scRNA-seq Data

Single-cell RNA sequencing (scRNA-seq) provides detailed insights into gene expression at the cellular level. In this thesis, we use explainable AI (XAI) methods to investigate which genes most influence the predictions of a… Mehr

24.06.2025 | Physics & Deep Learning, Ecosystems

Enhancing Regional Climate Projections with Deep Learning

Global climate models play a vital role in predicting future climate conditions, such as temperature or precipitation. However, their coarse spatial resolution (typically 50–100 km) limits their ability to capture regional climate… Mehr

15.05.2025 | Physics & Deep Learning, Ecosystems

Autoregressive Deep Learning Earth System Models

Interested in AI and Climate Science? Simulating the Earth’s climate with Earth System Models (ESMs) is essential for understanding and predicting climate change — past, present, and future. These physics-based simulations are… Mehr

08.05.2025 | Security

LLM-based Intelligent Literature Review Assistant

Develop a semi-automated assistant for structured literature reviews, combining LLMs, citation graph search, and spreadsheet-based curation. This project builds on an existing prototype with basic PDF parsing, citation crawling,… Mehr

06.05.2025 | Security

Elastic Stack Integration for Cyber Security Simulation Logging

Adapt the existing CIDDS simulation environment so that system and network logs are forwarded directly into the Elastic Stack. Evaluate logging tools (e.g., Winlogbeat, Filebeat, Auditbeat) and configure them for simulated… Mehr

06.05.2025 | Security

Automating Attack Simulations with MITRE ATT&CK

Extend an existing simulation environment to execute reproducible cyber-attack scenarios using the MITRE ATT&CK framework and procedures for example from Aomic Red Team. The focus is on automation of attack step execution and… Mehr

06.05.2025 | Security

XAI Benchmarking for Cyber Security

Investigate how to derive explanation-relevant features from simulated SIEM logs by analyzing known attack scenarios. Use publicly available annotated attack logs (e.g., from securitydatasets.com) to identify explanation-relevant… Mehr

06.05.2025 | Security

Domain-Specific Representation-Learning for Cyber Security

Develop and evaluate representation learning methods on structured security log data. The goal is to capture domain-specific patterns from benign and attack behavior that can later support anomaly detection and explainability in… Mehr

30.04.2025 | Security

Role-Based User Behavior Simulation in Enterprise IT Environments

Develop lightweight simulation agents that mimic realistic user behavior (e.g., office staff, developers, admins) in a controlled lab setup. Agents should execute atomic tasks such as browsing, file access, or command usage, with… Mehr

17.02.2025 | Natural Language Processing

Exploring and Evaluating Various Domain Adaptation Techniques

Domain adaptation involves modifying a model, originally trained on general data, to perform effectively in more specialized fields such as medicine, politics, law or instruction following. Mehr

30.01.2025 | Security

Agent-based Machine Learning for Offensive and Defensive Cyber Security

Automatisierte Agenten bieten ein enormes Potenzial für offensive und defensive Cybersecurity. Von der Informationsbeschaffung über Angriffe bis hin zur Erkennung und Abwehr von Angriffen können sie eine Vielzahl komplexer… Mehr

30.01.2025 | Security

Knowledge-based Machine Learning for Proactive Cyber Threat Modeling and Detection Using Ontologies and Knowledge Graphs

Die Erkennung und Modellierung von Cyberangriffspfaden erfordert eine automatisierte Analyse von Bedrohungen. Frameworks wie MITRE ATT&CK oder D3FEND bieten umfangreiche Wissensbasen, werden jedoch oft isoliert genutzt. Mehr

22.01.2025 | Physics & Deep Learning, Ecosystems

Spherical Embeddings or Spherical Fourier Neural Operators

Spherical Harmonics by Inigo.quilez, licensed under CC BY-SA 3.0, via Wikimedia Commons, (https://commons.wikimedia.org/wiki/File:Spherical_Harmonics.png)

Although our planet is a sphere and therefor all weather and climate processes operate on sphere, Deep Learning Weather Models very commonly operate on different rectangular projections of the sphere. This introduces increasingly… Mehr

Theses and Practica

Open Topics

Multimodal Integration of scRNA-seq and ATAC-seq Data for Optimizing Latent Representations

Foundation Large Language Model for HTML

Generating Single Cell RNA-seq Data with Diffusion Models in a Semi-Unsupervised Setting

Assessing the Suitability of (Zero-Inflated) Negative Binomial Distributions for scRNA-seq Gene Expression Data

Using Explainable AI to Uncover Gene Relevance in scRNA-seq Data

Enhancing Regional Climate Projections with Deep Learning

Autoregressive Deep Learning Earth System Models

LLM-based Intelligent Literature Review Assistant

Elastic Stack Integration for Cyber Security Simulation Logging

Automating Attack Simulations with MITRE ATT&CK

XAI Benchmarking for Cyber Security

Domain-Specific Representation-Learning for Cyber Security

Role-Based User Behavior Simulation in Enterprise IT Environments

Exploring and Evaluating Various Domain Adaptation Techniques

Agent-based Machine Learning for Offensive and Defensive Cyber Security

Knowledge-based Machine Learning for Proactive Cyber Threat Modeling and Detection Using Ontologies and Knowledge Graphs

Spherical Embeddings or Spherical Fourier Neural Operators

Bildnachweise