The Eval Index / Agent Eval / #119

RouteWorks/RouterArena

by RouteWorks · Agent Eval · updated today

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

56
momentum
94
stars
27
forks
#119
rank
arenallmllm-routerllm-routingmulti-agentmulti-agent-systemsrouter-benchmarkrouter-evaluationrouter-leaderboardrouting
View on GitHub →