#rlhf

[ follow ]
#ai-transparency
fromMedium
2 weeks ago
Artificial intelligence

The case for the uncertain AI: Why chatbots should say "I'm not sure"

fromMedium
2 weeks ago
Artificial intelligence

The case for the uncertain AI: Why chatbots should say "I'm not sure"

fromMedium
2 weeks ago
Artificial intelligence

The case for the uncertain AI: Why chatbots should say "I'm not sure"

fromMedium
2 weeks ago
Artificial intelligence

The case for the uncertain AI: Why chatbots should say "I'm not sure"

Online learning
fromHackernoon
1 year ago

Direct Nash Optimization Beats Bigger Models with Better Data | HackerNoon

Offline contrastive training provides more valuable signals for model performance than traditional supervised fine-tuning methods.
[ Load more ]