I've seen that agents can build real, working, Stripe integrations. But not necessarily correct ones. I've seen it do non-idempotent database operations in webhook handlers and also call Stripe APIs as a side-effect in synchronous flows.
Agents grind until the integration works in the happy path, but the devil is in the details as always.