Media Summary: How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic

Interpretability Hackathon 0 0 Keynote W Neel Nanda - Detailed Analysis & Overview

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic A talk I gave to my MATS 9.0 Training Program on using When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... This talk is a whirlwind overview of several key areas of open problems in mechanistic PART 1* — a comprehensive update on mechanistic

Photo Gallery

Interpretability Hackathon 0.0 Keynote w/ Neel Nanda
Interpretability Hackathon 3.0 Keynote - Neel Nanda
Interpretability Hackathon 2.0 Keynote - Neel Nanda
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
Mechanistic Interpretability 1.0 Hackathon - Neel Nanda
What Matters Right Now In Mechanistic Interpretability?
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
The Story of Mech Interp
Can Interpretability Control Model Training?
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
View Detailed Profile
Interpretability Hackathon 0.0 Keynote w/ Neel Nanda

Interpretability Hackathon 0.0 Keynote w/ Neel Nanda

Neel Nanda

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Neel Nanda

Interpretability Hackathon 2.0 Keynote - Neel Nanda

Interpretability Hackathon 2.0 Keynote - Neel Nanda

Neel Nanda

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ...

Mechanistic Interpretability 1.0 Hackathon - Neel Nanda

Mechanistic Interpretability 1.0 Hackathon - Neel Nanda

The

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic

Can Interpretability Control Model Training?

Can Interpretability Control Model Training?

A talk I gave to my MATS 9.0 Training Program on using

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Mechanistic

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ...

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda

Concrete open problems in mechanistic interpretability | Neel Nanda | EAG London 23

Concrete open problems in mechanistic interpretability | Neel Nanda | EAG London 23

This talk is a whirlwind overview of several key areas of open problems in mechanistic

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

PART 1* — a comprehensive update on mechanistic

A Whirlwind Tour of Mechanistic Interpretability - Neel Nanda

A Whirlwind Tour of Mechanistic Interpretability - Neel Nanda

Neel Nanda

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

Neel Nanda