The Eval Index / Red Teaming & Safety / #237

Mobius-Dev/mobius-llm-adversity

by Mobius-Dev · Red Teaming & Safety · updated 10mo ago

This repository documents a series of experiments focused on adversarial prompting and jailbreaks against large language models. It is part of my personal red teaming portfolio, intended to showcase prompt engineering techniques, jailbreak persistence, and alignment failure analysis.

23
momentum
79
stars
12
forks
#237
rank
View on GitHub →