The Eval Index / Benchmarks / #140

SeekingDream/Static-to-Dynamic-LLMEval

by SeekingDream · Benchmarks · updated 3mo ago

The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static to dynamic evaluation"

47
momentum
497
stars
38
forks
#140
rank
benchmarkdynamic-evaluationevaluationlarge-language-modelllmllmstesting
View on GitHub →