Media Summary: What if AI could learn from its mistakes the same way humans do? Portal is the home of the AI for drug discovery community. Join for more details on this talk and to connect with the speakers: ... ... 12 Feb 2026 Ted Kyi presents a deep dive into "

Reinforcement Learning Via Self Distillation - Detailed Analysis & Overview

What if AI could learn from its mistakes the same way humans do? Portal is the home of the AI for drug discovery community. Join for more details on this talk and to connect with the speakers: ... ... 12 Feb 2026 Ted Kyi presents a deep dive into " Can AI learn more from a "Why" than a "No"? Explore how This episode provides a technical summary and analysis of the research paper " In this AI Research Roundup episode, Alex discusses the paper: '

Recorded live at the Agent Engineering Session Day from the AI Engineer Summit 2025 in New York. Learn more at ... In this video, we will learn about two great RL methods for

Photo Gallery

Reinforcement Learning via Self-Distillation
ETH Zurich/Max Planck Institute/MIT/Stanford: Reinforcement Learning via Self-Distillation
[Paper of the Day] SDPO: Reinforcement Learning via Self-Distillation
How to build a consistency model: Learning flow maps via self-distillation | Nicholas Boffi
Reinforcement Learning via Self-Distillation (Jan 2026)
RL via Self-Distillation (SDPO) Paper Club 12 Feb 2026
Reinforcement Learning via Self-Distillation: Solving the Credit Assignment Problem
2601.20802 - Reinforcement Learning via Self-Distillation
Reinforcement Learning from scratch
SDPO: Reinforcement Learning via Self-Distillation (Hübotter et al.)
Self-Distillation Enables Continual Learning - Idan Shenfeld
How AI Learns to Critique Its Own Failures
View Detailed Profile
Reinforcement Learning via Self-Distillation

Reinforcement Learning via Self-Distillation

This week we review the paper

ETH Zurich/Max Planck Institute/MIT/Stanford: Reinforcement Learning via Self-Distillation

ETH Zurich/Max Planck Institute/MIT/Stanford: Reinforcement Learning via Self-Distillation

Unlocking

[Paper of the Day] SDPO: Reinforcement Learning via Self-Distillation

[Paper of the Day] SDPO: Reinforcement Learning via Self-Distillation

What if AI could learn from its mistakes the same way humans do?

How to build a consistency model: Learning flow maps via self-distillation | Nicholas Boffi

How to build a consistency model: Learning flow maps via self-distillation | Nicholas Boffi

Portal is the home of the AI for drug discovery community. Join for more details on this talk and to connect with the speakers: ...

Reinforcement Learning via Self-Distillation (Jan 2026)

Reinforcement Learning via Self-Distillation (Jan 2026)

Title:

RL via Self-Distillation (SDPO) Paper Club 12 Feb 2026

RL via Self-Distillation (SDPO) Paper Club 12 Feb 2026

... 12 Feb 2026 Ted Kyi presents a deep dive into "

Reinforcement Learning via Self-Distillation: Solving the Credit Assignment Problem

Reinforcement Learning via Self-Distillation: Solving the Credit Assignment Problem

Can AI learn more from a "Why" than a "No"? Explore how

2601.20802 - Reinforcement Learning via Self-Distillation

2601.20802 - Reinforcement Learning via Self-Distillation

title:

Reinforcement Learning from scratch

Reinforcement Learning from scratch

How does

SDPO: Reinforcement Learning via Self-Distillation (Hübotter et al.)

SDPO: Reinforcement Learning via Self-Distillation (Hübotter et al.)

This video provides an overview of

Self-Distillation Enables Continual Learning - Idan Shenfeld

Self-Distillation Enables Continual Learning - Idan Shenfeld

...

How AI Learns to Critique Its Own Failures

How AI Learns to Critique Its Own Failures

This episode provides a technical summary and analysis of the research paper "

OPSD: Faster LLM Reasoning via Self-Distillation

OPSD: Faster LLM Reasoning via Self-Distillation

In this AI Research Roundup episode, Alex discusses the paper: '

Self-Distillation Enables Continual  Learning

Self-Distillation Enables Continual Learning

Unlocking the Future of AI:

Agent Learns to do Reinforcement Learning

Agent Learns to do Reinforcement Learning

"In-context

SPD: Boosting LLMs via Self-Distillation

SPD: Boosting LLMs via Self-Distillation

In this AI Research Roundup episode, Alex discusses the paper: '

Self-Distillation Enables Continual Learning

Self-Distillation Enables Continual Learning

Paper:

SDAR: Gated Self-Distillation for Stable Agentic Reinforcement Learning

SDAR: Gated Self-Distillation for Stable Agentic Reinforcement Learning

Introducing SDAR, a new

Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley

Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley

Recorded live at the Agent Engineering Session Day from the AI Engineer Summit 2025 in New York. Learn more at ...

How to solve Reinforcement Learning when there are ZERO rewards (Curiosity & RND)

How to solve Reinforcement Learning when there are ZERO rewards (Curiosity & RND)

In this video, we will learn about two great RL methods for