#

coding-agent-benchmark

Here are 3 public repositories matching this topic...

linny006 / agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

Updated Jun 16, 2026
Python

nripankadas07 / patchgym

Turn any Git repository into a local SWE-bench-style coding-agent benchmark.

python git testing benchmark evaluation developer-tools flagship ai-agents local-first hidden-tests llm coding-agents agent-evaluation reproducible-ai swe-bench coding-agent-benchmark release-track local-benchmark

Updated May 28, 2026
Python

ttxs69 / awesome-coding-agent-eval

A curated list of benchmarks, harnesses, leaderboards, and tools for evaluating AI coding agents.

benchmark leaderboard evaluation awesome-list codex ai-agent llm aider claude-code coding-agent swe-bench agent-eval ai-coding-agent-benchmark coding-agent-benchmark

Updated Jun 8, 2026

Improve this page

Add a description, image, and links to the coding-agent-benchmark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the coding-agent-benchmark topic, visit your repo's landing page and select "manage topics."