Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
cpursley
1y ago
0 comments
Share
Any tips on effectively getting financial data out of PDFs into a RAG system (especially data contained in tables)? And locally, not via proprietary cloud PDF parsing thingy. That's the current nut I'm trying to crack.
0 comments
default
newest
oldest
rawsh
1y ago
https://github.com/VikParuchuri/marker
is solid, but slow and needs gpu(s) to be practical
serjester
1y ago
You might find my library useful -
https://github.com/Filimoa/open-parse
j
/
k
navigate · click thread line to collapse