~/

projects

[active] Simplicity-Bias Research Project — What does **'simplicity'** mean for neural networks? how complex a property is to represent in an input tensor (**a priori**), and how complex an internal mechanism must be to detect and exploit it for reward (**a posteriori**). The project aims to formalize the first with **descriptive complexity theory** and the second with **singular learning theory**, and to study how the two interact — with **simplicity bias** and **goal misgeneralization** as the motivating phenomena. [active] Arguments Map — Building a comprehensive map of the **arguments around AI safety and AI risk** — their assumptions, their sources, and how claims relate to one another. The aim — make it easy for **anyone** to **locate where they stand** and what follows. Currently in Obsidian; will become its own site. This is one of the main activities I decided to pursue at **AFFINE 2026**. [active] Field-Building — As I am building my theoretical and practical background to help in AI safety, the way I can make the most impact is by **helping shape the landscape** for those who can already impact the field with their work. Founding team member of **Safe AI Netherlands (SAIN)**. Active in **AI Safety Amsterdam (AISA)**. Working to **seed an AI safety community in Milan**.