Tom-Felix Berger

PhD student

Ruhr University Bochum

Tom-Felix is a predoctoral researcher in philosophy of AI with a background in philosophy (M.A.), data science (M.Sc.) and mathematics (B.A.). His research interests include AI deception, artificial minds, and mechanistic interpretability for LLMs. His dissertation “Deception by Largue Language Models - a Mechanistic Perspective Grounded in the Philosophy of Artificial Minds” is supervized by Albert Newen and Christian Straßer. You find his webpage here.

Interests

AI deception, AI alignment, philosophy of artificial minds
Mechanistic interpretability in LLMs, Probing, Truth Representations
Evolution of Morality, Evolutionary Debunking, Evolutionary Game Theory

Education

B.A. in philosophy and mathematics, 2022

Ruhr-Universität Bochum
M.A. in philosophy, 2024

Ruhr-University Bochum
M.Sc. in data science, 2026

Fernuniversität Hagen