The Eval Index / Benchmarks / #16

open-compass/opencompass

by open-compass · Benchmarks · updated today

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

79
momentum
7,081
stars
788
forks
#16
rank
benchmarkchatgptevaluationlarge-language-modelllama2llama3llmopenai
View on GitHub →