1. Linux Running in a PDF (doompdf.dev)
114 points by theden 4 hours ago | flag | hide | 37 comments
2. Ingesting PDFs and why Gemini 2.0 changes everything (sergey.fyi)
978 points by serjester 17 hours ago | flag | hide | 323 comments
As people start bolting various kinds of PDF parsers and evaluators to LLMs, there's got to be some interesting hack potential.Oh... I guess if you can somehow have it trigger a "load an image with this query string" or something that could be a way to communicate with the rest of the world
And OFC there's a chess engine in PS, a tic tac toe, and with patience you could even play NES games, but you might need to play with the stack a lot.
ClamAV: Js.Trojan.Obfus-48
Cylance: Unsafe
Google: Detected
Not sure if theres a reason for that like chrome allows more code execution within a document or something?
gs \
-dNOPAUSE \
-sDEVICE=pdfwrite \
-sOUTPUTFILE=clean.pdf \
-dBATCH \
dirty.pdf
[0]: https://tex.stackexchange.com/a/481609/29430At a minimum, you'd have to sandbox it using something like gVisor.
How would you structure your workflow to protect from potentially malicious PDFs?
I had originally thought of setting up an inotifywait watcher that would look for downloaded PDFs to swap downloaded files (while leaving a *_with-risky-active-contents.pdf copy).
After thinking for a bit about your comment, I thought about creating a .desktop file that first cleans the PDF via `docker run --runtime=runsc -it ubuntu gs ...` that then proceeds to launch the viewer, and is associated as the main reader of PDF documents...
But now I am wondering if this should be integrated into clamav and other antivirus clients (and unblocking on a case-by-case basis).
https://www.alchemistowl.org/pocorgtfo/
I confess to having become a fan long, long ago ..
> Ingesting PDFs and why Gemini 2.0 changes everything
Be afraid.
Be very afraid.
What one shouldn't do is to use any of that for "serious" purpose, but that kind of stuff is apart of what makes computing great - boundaries are in the imagination.
I thought it was the ease and leisure as well as the aid in pursuit of real knowledge they were supposed to introduce into our lives.
Can we spend our time more wisely instead?
"Please wait... If this message is not eventually replaced by the proper contents of the document, your PDF viewer may not be able to display this type of document."
... because they pull down their actual contents using JS from some website. They are the anti-thesis to what a pdf file is meant to be. Truly the worst of both worlds and a huge step backwards in accessibility and longevity. All that a screen reader can read is the above text.
If yes, Adobe has this friendly AI assistant forced into your face and overlapping floating toolbar on all sides of your document that you cannot get rid of to get a clean view of the document itself.
So your dream of a simple lightweight clutterfree PDF reader will remain a dream, unfortunately.
[0] https://www.adobe.com/devnet-docs/acrobatetk/tools/ReleaseNo...
[1] https://www.adobe.com/devnet-docs/acrobatetk/tools/ReleaseNo...
Then again, I'm in no way running a billion dollar successful software company so what do I know?