ai-twinkle/Eval

by ai-twinkle · Reasoning · updated 2mo ago

High-performance LLM evaluation framework with parallel API calls — up to 17× faster than sequential tools. Supports box, math, and logit-based evaluation.

momentum