We fooled AI sixteen times.
Here's the full story.
AI safety and security matter. AI agents take actions inside companies every day: summarizing emails, processing documents, opening tickets, reacting to error logs. We documented 16 successful attacks on Claude Sonnet and 5 on Opus, all using ordinary business inputs. Each one comes with a practical fix.
Every finding became a course lesson. This page tells the whole story: what we tried, what worked against the AI, and what to do about it.