CooperBench is an open-source benchmarking suite designed to evaluate the performance and coordination of AI agent teams on collaborative coding tasks. It provides a standardized set of over 600 tasks, leaderboards, and analysis tools to help researchers and developers assess how well agents work together and with humans. The benchmark highlights challenges in agent coordination and supports the development of more effective multi-agent systems.
Latest indexed changes and source events
cooperbench.com discovered by the PulseGate indexer
Other apps tracked under the same category.