Just wait until application designers start to create machine-hostile interfaces to prevent vision models from recognizing what to click, or introducing rate limits on UI interactions. There is a strong desire to control how one's product (app, website, or whatever) is used.