Media Summary: Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

What Is Mechanistic Interpretability Neel Nanda Explains - Detailed Analysis & Overview

Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

See part 2 here: Implementing GPT-2 from Scratch Template notebook: ... Check out Gradient now and redeem your free 5$ credits! Solving AI Doomerism: ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... Neural networks have become increasingly impressive in recent years, but there's a big catch: we don't really know what they are ... A talk I gave to my MATS 9.0 training program about reasoning model

Photo Gallery

What is mechanistic interpretability? Neel Nanda explains.
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
What Matters Right Now In Mechanistic Interpretability?
The Story of Mech Interp
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
The Dark Matter of AI [Mechanistic Interpretability]
Mechanistic Interpretability - NEEL NANDA (DeepMind)
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
Part 2: 5. Interpretability
What is a Transformer? (Transformer Walkthrough Part 1/2)
Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]
What is interpretability?
View Detailed Profile
What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

...

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

Part 2: 5. Interpretability

Part 2: 5. Interpretability

Neel Nanda

What is a Transformer? (Transformer Walkthrough Part 1/2)

What is a Transformer? (Transformer Walkthrough Part 1/2)

See part 2 here: Implementing GPT-2 from Scratch https://neelnanda.io/transformer-tutorial-2 Template notebook: ...

Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]

Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]

Check out Gradient now and redeem your free 5$ credits! https://gradient.1stcollab.com/bycloud Solving AI Doomerism: ...

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

Neural networks have become increasingly impressive in recent years, but there's a big catch: we don't really know what they are ...

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

In this talk,

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

PART 1* — a comprehensive update on