The Eval Index / Benchmarks / #140
SeekingDream/Static-to-Dynamic-LLMEval
by SeekingDream · Benchmarks · updated 3mo ago
The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"
47
momentum
497
stars
38
forks
#140
rank
benchmarkdynamic-evaluationevaluationlarge-language-modelllmllmstesting
View on GitHub →