tool-eval is a command-line framework for evaluating the tool usage of AI agents. It provides researchers and developers with tools to analyze and benchmark agent interactions with external tools.
Latest indexed changes and source events
Other apps tracked under the same category.