The Eval Index / Agent Eval / #151

NoesisVision/nasde-toolkit

by NoesisVision · Agent Eval · updated 2d ago

CLI for benchmarks & evals of AI coding agents — on tasks you already understand, using your Claude / Codex / Gemini individual subscriptions or API keys.

44
momentum
10
stars
0
forks
#151
rank
agent-benchmarkagent-evaluationai-coding-agentsai-evaluationclaude-codeclaude-skillscli-toolcodexevalsgemini-cliharborllm-as-a-judge
View on GitHub →