Media Summary: When Anthropic tested Claude Sonnet 4.5 for This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A talk I gave to my MATS 9.0 training program about reasoning model

Neel Nanda Our Pivot To Pragmatic Interpretability Alignment Workshop - Detailed Analysis & Overview

When Anthropic tested Claude Sonnet 4.5 for This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A talk I gave to my MATS 9.0 training program about reasoning model How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ... A talk I gave to my MATS 9.0 Training Program on using

This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic Atticus Geiger and I go through his paper, Finding

Photo Gallery

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
What Matters Right Now In Mechanistic Interpretability?
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
How Reasoning Models Break Mechanistic Interpretability Techniques
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
Mechanistic Interpretability - NEEL NANDA (DeepMind)
19 - Mechanistic Interpretability with Neel Nanda
Part 2: 5. Interpretability
Neel Nanda–Mechanistic Interpretability, Superposition, Grokking
Can Interpretability Control Model Training?
View Detailed Profile
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

When Anthropic tested Claude Sonnet 4.5 for

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ...

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit

19 - Mechanistic Interpretability with Neel Nanda

19 - Mechanistic Interpretability with Neel Nanda

How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ...

Part 2: 5. Interpretability

Part 2: 5. Interpretability

Neel Nanda

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda

Can Interpretability Control Model Training?

Can Interpretability Control Model Training?

A talk I gave to my MATS 9.0 Training Program on using

Neel Nanda: Mechanistic Intepretability (HAAISS 2024)

Neel Nanda: Mechanistic Intepretability (HAAISS 2024)

Neel Nanda

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of mechanistic

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Open Problems in Mechanistic Interpretability: A Whirlwind Tour | Neel Nanda | EAGxVirtual 2023

Mechanistic

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Interpretability Hackathon 3.0 Keynote - Neel Nanda

Neel Nanda

A Walkthrough of Aligning Causal Variables and Distributed Representations w/ Atticus Geiger (1/3)

A Walkthrough of Aligning Causal Variables and Distributed Representations w/ Atticus Geiger (1/3)

Atticus Geiger and I go through his paper, Finding

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: Why? (Part 3/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: Why? (Part 3/3)

Part 3 of a walkthrough of