The Eval Index / Benchmarks / #203

DMindAI/DMind-Benchmark

by DMindAI · Benchmarks · updated 4mo ago

A comprehensive framework for evaluating large language models (LLMs) on blockchain, cryptocurrency, and Web3 knowledge across multiple domains.

31
momentum
52
stars
3
forks
#203
rank
View on GitHub →