The Eval Index / Benchmarks / #248
h2oai/h2o-LLM-eval
by h2oai · Benchmarks · updated 1y ago
Large-language Model Evaluation framework with Elo Leaderboard and A-B testing
21
momentum
52
stars
1
forks
#248
rank
Large-language Model Evaluation framework with Elo Leaderboard and A-B testing