The Eval Index / Benchmarks / #248

h2oai/h2o-LLM-eval

by h2oai · Benchmarks · updated 1y ago

Large-language Model Evaluation framework with Elo Leaderboard and A-B testing

21
momentum
52
stars
1
forks
#248
rank
View on GitHub →