undefined | Better HN

0 pointspzo4mo ago0 comments

yeah I feel the same, I think even having a screenshot of part of rendered page or full page can be useful even for machines considering how heavy those HTML can be to parse and expensive for LLM context. Sometimes (sub)screenshot is just a better kind of compression

0 comments

fbouvier4mo ago

Yes HTML is too heavy and too expensive for LLM. We are working on a text-based format more suitable for AI.

httpteapot4mo ago

What do you think of the DeepSeek OCR approach where they say that vision tokens might better compress a document than its pure text representation?

https://news.ycombinator.com/item?id=45640594

I've spent some time feeding llm with scrapped web pages and I've found that retaining some style information (text size, visibility, decoration image content) is non trivial.

fbouvier4mo ago

Keeping some kind of style information is definitely important to understand the semantics of the webpage.

j / k navigate · click thread line to collapse

0 comments

fbouvier4mo ago

Yes HTML is too heavy and too expensive for LLM. We are working on a text-based format more suitable for AI.

httpteapot4mo ago

What do you think of the DeepSeek OCR approach where they say that vision tokens might better compress a document than its pure text representation?

https://news.ycombinator.com/item?id=45640594

I've spent some time feeding llm with scrapped web pages and I've found that retaining some style information (text size, visibility, decoration image content) is non trivial.

fbouvier4mo ago

Keeping some kind of style information is definitely important to understand the semantics of the webpage.

j / k navigate · click thread line to collapse