I'm just wondering why multiple parties seem reluctant to allow the public to use this.
Basically, multi-modal functionality should be an OOM increase in compute, traffic, and storage requirements for anyone providing it compared to a text-only model (or an only-text-allowed model).
Anyone who acts all casual, as if there is not a constellation of vultures circling AI right now should consider themselves 'off-grid'
1. --- It will kill captchas for good. Half of the internet is protected by Cloudflare or Google captchas at this point. Spam, fraud, and other trouble has a maximum possible volume because you can only pay a human in India so little to solve them for you. If you have an algorithm that can complete it, the game is up. Sites may as well not have a captcha at all. Prevention then becomes much more Orwellian with hardware TPM attestation solutions and the internet as we know is forever changed.
2. --- It will show corporations and governments just how all-seeing video surveillance could be. Human or (by some reports, above-human) level computer vision is a Pandora's box all by itself.
OpenAI might simply be wanting to avoid opening any more family-size cans of worms than there already are.
We started with GPT API but switched to Vertex AI due to speed. We will still use GPT API as backup still though.
1. text-bison limited to 1024 output tokens.
2. Output format we ask for JSON. But it is not valid json many times (, after last element, missing } after element etc). We have to write our own parsing code in the end to work around these JSON format issues.
Edit: sorry, that was a different experiment. The one that worked well was an address splitter, trained off Google Address Validator output, funnily enough. Still, the output JSON got prefixed with some of the address input.
It seems we may find companies on all major cloud providers in the near future to guarantee access to unique proprietary services that cloud providers are starting to differentiate themselves with from their competitors
For GCP specifically, the Anthis/Omni stuff seems like a way to sell those services even if the infrastructure isn't actually in GCP.
Is it Azure OpenAI service (that seems too simple!)?
what i did learn, is that somehow, google has all of my credit cards despite me never sharing it on the account i was using.