14. Keep distillation training from drifting
Use checkpoints, replay buffers, curriculum schedules, and staged teachers to make training reliable. The chapter covers divergence, catastrophic mistakes, teacher overfitting, and practical debugging signals.