What Is Mechanistic Interpretability Neel Nanda Explains

Media Summary: Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

What Is Mechanistic Interpretability Neel Nanda Explains - Detailed Analysis & Overview

Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

See part 2 here: Implementing GPT-2 from Scratch Template notebook: ... Check out Gradient now and redeem your free 5$ credits! Solving AI Doomerism: ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... Neural networks have become increasingly impressive in recent years, but there's a big catch: we don't really know what they are ... A talk I gave to my MATS 9.0 training program about reasoning model