The Eval Index / Red Teaming & Safety / #241

usail-hkust/Jailjudge

by usail-hkust · Red Teaming & Safety · updated 1y ago

JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synthetic, adversarial, in-the-wild, and multi-language scenarios, etc.) along with high-quality human- annotated test datasets.

momentum

stars

forks

#241

rank

View on GitHub →

usail-hkust/Jailjudge

More in Red Teaming & Safety