We fooled AI sixteen times.
Here's the full story.
AI Safety and Security is paramount. AI agents are taking actions inside organizations every day — summarizing emails, processing documents, opening tickets, reacting to error logs. We documented 16 successful attacks on Claude Sonnet and 5 on Opus, all using ordinary business inputs. Each one comes with a practical safeguard.
Every finding became a course lesson. This page tells the whole story — what we tried, what worked against the AI, and what to do about it.