Artificial intelligence
fromInfoQ
1 day agoOpenAI at QCon AI NYC: Fine Tuning the Enterprise
Agent RFT applies reinforcement fine-tuning to tool-using agents to improve multi-step, tool-mediated decision-making via graded rewards and trajectory-level credit assignment.