Tom Henighan

I work on large language model interpretability at Anthropic. Prior to that I worked on scaling laws at OpenAI and ML engineering at at Beehive AI. I did my PhD in Physics at Stanford.

Github / Scholar