huggingface/evaluation-guidebook

by huggingface · Benchmarks · updated 6mo ago

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

momentum

2,117

stars

124

forks

#158

rank

evaluationevaluation-metricsguidebooklarge-language-modelsllmmachine-learningtutorial

View on GitHub →

huggingface/evaluation-guidebook

More in Benchmarks