Email - ashray@rephrase.ai
To understand our product market fit, we need to understand if the retention for our videos is better after our processing (we do this for advertisers, so better retention means higher returns for them). I was looking for tools which with a webcam feed can track how "attentive" the person is, but I could not find anything good.
The best resource I found was https://www.realeyesit.com but they are asking us for 17.5k USD (which is obviously unaffordable for a pre-product startup). Some other resources like https://xlabsgaze.com are highly inaccurate.
Do open source tools exist for video engagement estimation using webcam? Alternatively, are there more accessible tools for this purpose (under $100 per month).
Something even as simple as a continuous background voice to text converter, and when she wants to "talk" a text to voice converter should be much better for her than her current status (my assumption is she always has a laptop or mobile screen in front of her).
This should be as simple as integrating some Voice to text and text to voice APIs from anywhere (say Google cloud, assuming internet connectivity isn't a problem?)
This seems incredibly basic and I assume systems like this should exist! If not, I could hack it over a weekend. Could you please link me to the existing tools you know to solve this problem for folks with hearing problems?