GitHub - Addepto/contextcheck: Human-friendly framework to test and evaluate LLMs, RAGs and ChatBots.
Briefly

ContextCheck is a human-friendly open-source framework that evaluates and tests large language models, RAG systems, and chatbots, ensuring their reliability and robustness.
The framework supports flexible configuration using human-readable YAML files, allowing easy integration into CI pipelines for continuous automated testing and validation.
With ContextCheck, you can define simple test scenarios, validate responses through various methods, and customize output formats for clear displays of test results.
Users can easily install ContextCheck via PyPI, set up a development environment with Poetry, and contribute to the project directly on GitHub.
Read at GitHub
[
|
]