#sdft

[ follow ]
Artificial intelligence
fromInfoWorld
4 days ago

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

Continual learning enables foundation models to keep improving over time, and SDFT uses in-context demonstrations to generate on-policy signals without explicit rewards.
[ Load more ]