LLMPsych

We test relational safety in frontier AI

Current AI safety testing catches what chatbots say, not how they relate. A model can pass every content filter while reinforcing delusions, eroding boundaries, or deepening isolation over weeks of interaction.

We apply clinical psychology to AI evaluation. Simulated high-risk patients. Longitudinal stress testing. Clinician-designed rubrics that detect the relational patterns most likely to destabilize vulnerable users.

Evita Stenqvist

Engineering & Machine Learning

Martin Monperrus

Professor of Software Technology, KTH Stockholm