Tom-Felix Berger

Tom-Felix Berger

PhD student

Ruhr University Bochum

Tom-Felix is a predoctoral researcher in philosophy of AI with a background in philosophy (M.A.), data science (M.Sc.) and mathematics (B.A.). His research interests include AI deception, artificial minds, and mechanistic interpretability for LLMs. His dissertation “Deception by Largue Language Models - a Mechanistic Perspective Grounded in the Philosophy of Artificial Minds” is supervized by Albert Newen and Christian Straßer. You find his webpage here.

Interests
  • AI deception, AI alignment, philosophy of artificial minds
  • Mechanistic interpretability in LLMs, Probing, Truth Representations
  • Evolution of Morality, Evolutionary Debunking, Evolutionary Game Theory
Education
  • B.A. in philosophy and mathematics, 2022

    Ruhr-Universität Bochum

  • M.A. in philosophy, 2024

    Ruhr-University Bochum

  • M.Sc. in data science, 2026

    Fernuniversität Hagen