rubric-eval is an open-source CLI tool for testing and evaluating the behavior of agents in LLM applications. It auto-captures agent runs, integrates with evaluation tools, and helps developers catch regressions in CI pipelines. The tool is designed for developers working with AI agents and LLMs.
Latest indexed changes and source events
rubric-eval discovered by the PulseGate indexer
Other apps tracked under the same category.