All Benchmark Runs

Date eforge Version Dataset Instances Resolved Rate Details
2026-03-28T03-05-38 unknown princeton-nlp/SWE-bench_Lite 5 2 40.0% View