The Eval Index / Eval Frameworks / #46

stanford-crfm/helm

by stanford-crfm · Eval Frameworks · updated 7d ago

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

momentum

2,822

stars

398

forks

#46

rank

View on GitHub →

stanford-crfm/helm

More in Eval Frameworks