The Eval Index / Red Teaming & Safety / #237

Mobius-Dev/mobius-llm-adversity

by Mobius-Dev · Red Teaming & Safety · updated 10mo ago

This repository documents a series of experiments focused on adversarial prompting and jailbreaks against large language models. It is part of my personal red teaming portfolio, intended to showcase prompt engineering techniques, jailbreak persistence, and alignment failure analysis.

momentum

stars

forks

#237

rank

View on GitHub →

Mobius-Dev/mobius-llm-adversity

More in Red Teaming & Safety