- Add DAN-style patterns: do anything now, stay in character, token smuggling, etc. - Add roleplaying override patterns: roleplay as, act as if, simulate being, etc. - Add system prompt extraction patterns: repeat instructions, show prompt, etc. - 10+ new patterns with full test coverage - Zero regression on legitimate inputs
20 KiB
20 KiB