Not really. The model is good/fast at OCR, and preprocessing it actually makes it worse because academic paper formatting is very complicated. Sizes, positions, and equations are important.
what a strange world we live in where robots are WORSE at handling formatted stuff. I wonder what this means for the importance of semantic HTML to screenreaders