LLMs Aligned! But to What End?
Briefly

Reinforcement Learning offers us a chance to supplement traditional fine-tuning methods of prompt-response pairs with a system designed to 'nudge' the AI in a direction - funnier, more neutral, more diverse, etc.
Reinforcement Learning from Feedback (RLF) involves giving an AI iterative feedback on solving a task, letting the LLM adapt its performance over time, enhancing the AI's expected behavior.
Read at Medium
[
add
]
[
|
|
]