ProgramBench is an open-source benchmark for evaluating the ability of language models and agents to rebuild software programs from compiled binaries and documentation. It provides tasks, leaderboards, and evaluation tools for researchers and developers working on code generation and reverse engineering.
Latest indexed changes and source events
programbench.com discovered by the PulseGate indexer
Other apps tracked under the same category.