stalkermustang/llm-bulls-and-cows-benchmark

by stalkermustang · Reasoning · updated 1y ago

A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.

momentum