What the OCR market needs is someone who will bring that level of OCR quality - or better - to the masses (perhaps some deep learning grad student with time to kill?), not yet another wrapper around Tesseract. We have those already!
Is this better than https://ocr.space ?
For my private documents I would always use offline OCR software like http://blog.a9t9.com/p/free-ocr-software.html
What's the privacy model? While the PDFs are deleted, what happens to the searchable content? Is it also deleted?
What's the revenue model? How can we be sure it'll be around in a few months?
Is there an AJAX interface?
Is the quality or performance better than running Tesseract on a server?
Thoughts?
BTW, how is this news?
It comes down to how many people agree it's interesting by upvoting :)
Don't know it the OCR function is available in the reader version.
Either way, super cool idea. My Dad will be stoked about this as he's been OCR'ing his way into oblivion for the past few years.