Media Summary: When Anthropic tested Claude Sonnet 4.5 for This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A talk I gave to my MATS 9.0 training program about reasoning model
Neel Nanda Our Pivot To Pragmatic Interpretability Alignment Workshop - Detailed Analysis & Overview
When Anthropic tested Claude Sonnet 4.5 for This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A talk I gave to my MATS 9.0 training program about reasoning model How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ... A talk I gave to my MATS 9.0 Training Program on using
This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic Atticus Geiger and I go through his paper, Finding