AI Safety

All articles tagged with "AI Safety"

1Total Articles

Prompt to Define What an Agent Must Never Do

An agent without explicit hard limits will interpret ambiguous instructions as permission. This prompt pattern defines absolute constraints that persist even when users push against them or provide creative justifications.