I really enjoy creating agents, but sometimes they’re doing quite sensitive work. For example editing records in HubSpot. Before I set an agent live, I’d love a way to test how it behaves without touching real, important data.
Ideally this would look like a dry run. Either using dummy data, a sandbox style integration, or a mode where I can see what the agent would change without those changes actually being applied.
That way I can sanity check the behaviour, get comfortable with the output, and only then switch it into a live environment.
Just food for thought, but this would make me much more confident using agents for critical workflows.