litebench is an open-source CLI tool that enables developers and researchers to benchmark large language models and AI agents. It supports quick setup and evaluation workflows, including popular benchmarks like GSM8K and HumanEval.
Latest indexed changes and source events
Other apps tracked under the same category.