The Eval Index / Red Teaming & Safety / #128
harnexa/nexa-gauge
by harnexa · Red Teaming & Safety · updated 5d ago
An graph-eval framework for LLM's
54
momentum
38
stars
13
forks
#128
rank
blue-scoregevalgroundingllmllm-evalllm-evaluation-frameworkllm-judgeranking-algorithmredteamrelevance-scoringrouge-metric
View on GitHub →