#sdft
#sdft

[ follow ]

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

Continual learning enables foundation models to keep improving over time, and SDFT uses in-context demonstrations to generate on-policy signals without explicit rewards.

[ Load more ]

#sdft#sdft

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

#sdft
#sdft