Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply... introducing a new initiative to fund evaluations developed by third-party organizations that can effectively measure advanced capabilities in AI models.
The main focus of the proposals should be centered around: AI Safety Level assessments; advanced capability and safety metrics; and infrastructure, tools, and methods for developing evaluations.
Collection
[
|
...
]