Interpretability Hackathon 0 0 Keynote W Neel Nanda

Media Summary: How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic

Interpretability Hackathon 0 0 Keynote W Neel Nanda - Detailed Analysis & Overview

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic A talk I gave to my MATS 9.0 Training Program on using When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... This talk is a whirlwind overview of several key areas of open problems in mechanistic PART 1* — a comprehensive update on mechanistic