#llm-evaluators

[ follow ]
Artificial intelligence
fromInfoQ
1 day ago

Google Stax Aims to Make AI Model Evaluation Accessible for Developers

Google Stax provides an objective, data-driven, repeatable framework for AI model evaluation with customizable datasets, default and custom evaluators, and LLM-based judges.
[ Load more ]