Media Summary: Hossein Mobahi, Google Research In supervised Authors: Zhenzhu Zheng (University of Delaware)*; Xi Peng (University of Delaware) Description: We present In this AI Research Roundup episode, Alex discusses the paper: 'A Predictive Law for On-Policy

Improving Generalization By Self Training Self Distillation - Detailed Analysis & Overview

Hossein Mobahi, Google Research In supervised Authors: Zhenzhu Zheng (University of Delaware)*; Xi Peng (University of Delaware) Description: We present In this AI Research Roundup episode, Alex discusses the paper: 'A Predictive Law for On-Policy This week we review the paper Reinforcement In this video, we sit down with Jonas Hübotter (ETH Zurich) and Idan Shenfeld (MIT) to break down Portal is the home of the AI for drug discovery community. Join for more details on this talk and to connect with the speakers: ...

In this AI Research Roundup episode, Alex discusses the paper: 'Anti- In this AI Research Roundup episode, Alex discusses the paper: 'Strong Teacher Not Needed? On Authors: Chen, Wei-Chi; Chu, Wei-Ta* Description: With labeled data, Authors: Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin Description: Deep neural networks with millions of parameters may ... Abstract: Smaller language models can develop robust reasoning capabilities through pre-

Photo Gallery

Improving Generalization by Self-Training & Self Distillation
Self-Guidance: Improve Deep Neural Network Generalization via Knowledge Distillation
Predict LLM Self-Distillation Before Training
Self-Distillation Enables Continual Learning
Self-Distillation Enables Continual  Learning
Reinforcement Learning via Self-Distillation
Knowledge Distillation: How LLMs train each other
Self-Distillation Enables Continual Learning - Idan Shenfeld
Embarrassingly Simple Self-Distillation Improves Code Generation
Self Regulated Learning Mechanism for Data Efficient Knowledge Distillation | IJCNN 2021
Why Self-Distillation Is Taking Over LLM Post-Training (w/ the Researchers Behind It)
Self-Distillation Enables Continual Learning Paper-2026
View Detailed Profile
Improving Generalization by Self-Training & Self Distillation

Improving Generalization by Self-Training & Self Distillation

Hossein Mobahi, Google Research In supervised

Self-Guidance: Improve Deep Neural Network Generalization via Knowledge Distillation

Self-Guidance: Improve Deep Neural Network Generalization via Knowledge Distillation

Authors: Zhenzhu Zheng (University of Delaware)*; Xi Peng (University of Delaware) Description: We present

Predict LLM Self-Distillation Before Training

Predict LLM Self-Distillation Before Training

In this AI Research Roundup episode, Alex discusses the paper: 'A Predictive Law for On-Policy

Self-Distillation Enables Continual Learning

Self-Distillation Enables Continual Learning

Paper:

Self-Distillation Enables Continual  Learning

Self-Distillation Enables Continual Learning

Unlocking the Future of AI:

Reinforcement Learning via Self-Distillation

Reinforcement Learning via Self-Distillation

This week we review the paper Reinforcement

Knowledge Distillation: How LLMs train each other

Knowledge Distillation: How LLMs train each other

In this video, we break down knowledge

Self-Distillation Enables Continual Learning - Idan Shenfeld

Self-Distillation Enables Continual Learning - Idan Shenfeld

... we see that we

Embarrassingly Simple Self-Distillation Improves Code Generation

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper: Embarrassingly Simple

Self Regulated Learning Mechanism for Data Efficient Knowledge Distillation | IJCNN 2021

Self Regulated Learning Mechanism for Data Efficient Knowledge Distillation | IJCNN 2021

Self

Why Self-Distillation Is Taking Over LLM Post-Training (w/ the Researchers Behind It)

Why Self-Distillation Is Taking Over LLM Post-Training (w/ the Researchers Behind It)

In this video, we sit down with Jonas Hübotter (ETH Zurich) and Idan Shenfeld (MIT) to break down

Self-Distillation Enables Continual Learning Paper-2026

Self-Distillation Enables Continual Learning Paper-2026

Continual

How to build a consistency model: Learning flow maps via self-distillation | Nicholas Boffi

How to build a consistency model: Learning flow maps via self-distillation | Nicholas Boffi

Portal is the home of the AI for drug discovery community. Join for more details on this talk and to connect with the speakers: ...

Self-Distillation Enables Continual Learning (Jan 2026)

Self-Distillation Enables Continual Learning (Jan 2026)

Title:

Anti-Self-Distillation for LLM Reasoning

Anti-Self-Distillation for LLM Reasoning

In this AI Research Roundup episode, Alex discusses the paper: 'Anti-

LLM Distillation: Strong Teachers Not Needed

LLM Distillation: Strong Teachers Not Needed

In this AI Research Roundup episode, Alex discusses the paper: 'Strong Teacher Not Needed? On

SSSD: Self-Supervised Self Distillation

SSSD: Self-Supervised Self Distillation

Authors: Chen, Wei-Chi; Chu, Wei-Ta* Description: With labeled data,

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Authors: Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin Description: Deep neural networks with millions of parameters may ...

Enhancing Reasoning in Smaller Models through Self-Training

Enhancing Reasoning in Smaller Models through Self-Training

Abstract: Smaller language models can develop robust reasoning capabilities through pre-

Self-Distillation as a New Framework for Continual Learning | Idan Shenfeld | Random Samples

Self-Distillation as a New Framework for Continual Learning | Idan Shenfeld | Random Samples

Self