Jailbreak prompt filtering system for LLMs.
Classify prompts using selected models
Attention Tracker for Prompt Injection Detection