The Eval Index / Red Teaming & Safety / #237
Mobius-Dev/mobius-llm-adversity
by Mobius-Dev · Red Teaming & Safety · updated 10mo ago
This repository documents a series of experiments focused on adversarial prompting and jailbreaks against large language models. It is part of my personal red teaming portfolio, intended to showcase prompt engineering techniques, jailbreak persistence, and alignment failure analysis.
23
momentum
79
stars
12
forks
#237
rank