The Eval Index / Benchmarks / #16
open-compass/opencompass
by open-compass · Benchmarks · updated today
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
79
momentum
7,081
stars
788
forks
#16
rank
benchmarkchatgptevaluationlarge-language-modelllama2llama3llmopenai
View on GitHub →