Projects

Things I've built

A mix of research, side projects, and experiments. Some are polished, some are just me chasing a question.

Twenty Questions, Interpreted

2026

A mechanistic-interpretability study of whether an LLM truly commits to a secret in 20 Questions, using linear probes, activation patching, steering, and sparse autoencoders on Gemma-3.

interpretabilityLLMsresearch

Two Heads or One? Multi-Agent LLM Reasoning

2025

Bachelor's thesis (UZH). Tests whether gains in multi-agent LLM reasoning come from genuinely separate model instances or just role-based perspective diversity. It compares two DeepSeek-V3 instances against a single model alternating roles, across Debate / Cooperative / Teacher-Student strategies on AIME, GPQA Diamond, and LiveBench Reasoning. Model separation helped most in critique-oriented dialogue; cooperative settings didn't require true independence.

LLMsmulti-agentreasoningthesis

Lexicon Meets Prosody

2025

Classifies overlapping speech in spontaneous multi-party conversation (AMI Meeting Corpus) as cooperative (e.g. backchannels) or competitive (e.g. interruptions). Combines Wav2Vec audio embeddings with lexical sentence embeddings from noisy ASR, trained via a weakly-supervised labeling pipeline (heuristics + LLM-assisted annotation). Adding lexical features improved performance, though competitive overlaps stayed hard.

speechASRclassification

Logistic Platonic Space

2026

A creative-coding passion project: an interactive simulation of 25,600 coupled chaotic agents based on the logistic map, exploring how locally simple systems produce globally coherent behavior. Inspired by Michael Levin's work on collective intelligence and biological agency, built to make the transition from local chaos to collective order visible and interactive.

creative codingsimulationcomplexity