Years ago it got sent to the cloud, but as long as you have an iPhone from the past few years it's on-device.
In each and every case I'm familiar with, streaming means "send the whole audio thus far to the inference engine, inference it, and send back the transcription"
I have a Flutter library that does the same flow as this (though via ONNX, so I can cover all platforms), and Whisper + Silero is ~identical to the interfaces I used at Google.
If the idea is streaming is when each audio byte is only sent once to the server, there's still an audio buffer accumulated -- its just on the server.
https://github.com/Helium314/HeliBoard
https://github.com/openboard-team/openboard
https://github.com/rkkr/simple-keyboard (guessing, since AOSP Keyboard works and this is a fork)
Not open source: https://www.microsoft.com/en-us/swiftkey
Does not have glide/swipe (reserved for symbols), but I just installed and giving it a shot: https://github.com/Julow/Unexpected-Keyboard
It does have glide typing, even.though I don't use it.
It rather uses long-tap to access multiple symbols, and can be split or pushed to a corner on devices with a big screen.
I'm very interested in using this, but I can't even find a way to try to troubleshoot it. I'm not finding usage instructions, never mind any kind of error messages. It just doesn't do anything.
This is especially interesting to me because the screenshot on the repo is from Vanadium, which strongly suggests to me that it's from a GrapheneOS device itself.
The thing I'm tripping over now is just that I keep pressing the button more than once when I'm done speaking because it's not clear that it registered the first time. If it could even just stay "pressed" or something while it processes the text, I think that would make it clearer. Any third state for the button would do I think.
Looking forward to using this! Thanks!
I would pay for an app that did this.
This is an unaffiliated version looks like https://apps.apple.com/us/app/live-transcribe/id1471473738
I understand that the author trusts itself more than F-Droid, but as a user the opposite seems more relevant.
I see the features listed[0] which seems like a reasonable feature set, but nothing unusual afaict.
If there has been a lot of hype can you tell me what people find compelling about it?