The Eval Index / Benchmarks / #263

khoj-ai/llm-coup

by khoj-ai · Benchmarks · updated 9mo ago

Let LLMs play coup with each other and see who's the best at deception & strategy

momentum

stars

forks

#263

rank

aiartificial-intelligencecoupdeceptionenvironmentgamesllm-benchmarkingllm-evalllm-evaluationllmssimulation

More in Benchmarks