The Eval Index / Coding Eval / #213
symflower/eval-dev-quality
by symflower ยท Coding Eval ยท updated 1y ago
DevQualityEval: An evaluation benchmark ๐ and framework to compare and evolve the quality of code generation of LLMs.
28
momentum
186
stars
10
forks
#213
rank
evaluationevaluation-frameworkllmssoftware-developmentsoftware-quality
View on GitHub โ