Glossary term
Jailbreak
A crafted prompt that coaxes a model past its safety constraints, making it produce content or take actions its policy forbids.
When it matters
When a public-facing model can be talked out of its limits, attackers may extract restricted output or trigger disallowed behaviour.
Related terms
Related articles
Related services
Last reviewed: