← Leaderboard

The Eval Index / Benchmarks / #248

h2oai/h2o-LLM-eval

by h2oai · Benchmarks · updated 1y ago

Large-language Model Evaluation framework with Elo Leaderboard and A-B testing

21

momentum

52

stars

1

forks

#248

rank

View on GitHub →