Artificial intelligence
fromInfoWorld
4 days agoResearchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs
Continual learning enables foundation models to keep improving over time, and SDFT uses in-context demonstrations to generate on-policy signals without explicit rewards.