Media Summary: This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

Mechanistic Interpretability 1 0 Hackathon Neel Nanda - Detailed Analysis & Overview

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ... Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...

A talk I gave to my MATS 9.0 training program about reasoning model Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

Photo Gallery

Mechanistic Interpretability 1.0 Hackathon - Neel Nanda
What Matters Right Now In Mechanistic Interpretability?
The Story of Mech Interp
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
Interpretability Hackathon 0.0 Keynote w/ Neel Nanda
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023
Neel Nanda: Mechanistic Interpretability & Mathematics
A Whirlwind Tour of Mechanistic Interpretability - Neel Nanda
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
View Detailed Profile
Mechanistic Interpretability 1.0 Hackathon - Neel Nanda

Mechanistic Interpretability 1.0 Hackathon - Neel Nanda

The keynote for the

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

Interpretability Hackathon 0.0 Keynote w/ Neel Nanda

Interpretability Hackathon 0.0 Keynote w/ Neel Nanda

Neel Nanda

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Mechanistic Interpretability

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda

A Whirlwind Tour of Mechanistic Interpretability - Neel Nanda

A Whirlwind Tour of Mechanistic Interpretability - Neel Nanda

Neel Nanda

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ...

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Neel Nanda

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

PART