Case studies and lessons from testing AI agents with synthetic personas.
I told Claude to give my agent calendar capabilities. It did โ minus the ability to invite anyone to a meeting. I found out by accident.