The Eval Index / RAG Eval / #188
onejune2018/Awesome-LLM-Eval
by onejune2018 · RAG Eval · updated 6mo ago
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
35
momentum
642
stars
74
forks
#188
rank
awsome-listawsome-listsbenchmarkbertchatglmchatgptdatasetevaluationgpt3large-language-modelleaderboardllama
View on GitHub →