The Eval Index / Red Teaming & Safety / #128

harnexa/nexa-gauge

by harnexa · Red Teaming & Safety · updated 5d ago

An graph-eval framework for LLM's

54
momentum
38
stars
13
forks
#128
rank
blue-scoregevalgroundingllmllm-evalllm-evaluation-frameworkllm-judgeranking-algorithmredteamrelevance-scoringrouge-metric
View on GitHub →